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2 

GTT TTA CAC CAT CAC CAA AAC CCA GAA TTT 
val leu his his his gin asn pro glu phe 

62 

GAG CTG CAT GAA AAG CAC CAC CTG TTG CTC 
gin leu his glu lys his his leu leu leu 

122 

AGT AAA GGA AGC ACG AAG AAG AGG GAT GTC 
ser lys gly ser thr lys lys arg asp val 

182 

CCC CTC CTG AAA GAG GGA AGG GTG GTG ACA 
pro leu leu lys asp gly arg val val thr 

242 

CTT CCT TCG GGG TAT CTT GGC TAC CAA GAG 
leu pro ser gly tyr leu gly tyr gin glu 

302 

ATT AAA TGG GTA GAT GGA GGC AAG CCA CTG 
ile lys trp val asp gly gly lys pro leu 

362 

GTG TAT ACT CAG GAT CAG CAT TTA CAT AAT 
val tyr thr gin asp gin his leu his asn 

422 

GGA GCC CAA GCC TTA GGA AAC GAA CTT GTA 
gly ala gin ala leu gly asn glu leu val 

482 

GGC CAC GTG ATG ATC GCC TTC TTG CCC ACT 
gly his val met ile ala phe leu pro thr 

542 

AGA GCC ACA CAG GAA GAA GTC GCG GTT AAC 
arg ala thr gin glu glu val ala val asn 

602 

CAG TGC CAT GAG GAA GGA TTG GAG AGC CAC 
gin cys his glu glu gly leu glu ser his 

662 

GCT GAG CCA TAT GTT GCC TCT GAA TAC AAG 
ala glu pro tyr val ala ser glu tyr lys 



1 
A 

32 

TAT GAT GAG ATT AAA ATA GAG TTG CCC ACT 
tyr asp glu ile lys ile glu leu pro thr 

92 

ACA TTC TTC CAT GTC AGC TGT GAC AAC TCA 
thr phe phe his val ser cys asp asn ser 

152 

GTT GAA ACC CAA GTT GGC TAC TCC TGG CTT 
val glu thr gin val gly tyr ser trp leu 

212 

AGC GAG CAG CAC ATC CCG GTC TCG GCG AAC 
ser glu gin his ile pro val ser ala asn 

272 

CTT GGG ATG GGC AGG CAT TAT GGT CCG GAA 
leu gly met gly arg his tyr gly pro glu 

332 

CTG AAA ATT TCC ACT CAT CTG GTT TCT ACA 
leu lys ile ser thr his leu val ser thr 

392 

TTT TTC CAG TAC TGT CAG AAA ACC GAA TCT 
phe phe gin tyr cys gin lys thr glu ser 

452 

AAG TAC CTT AAG AGT CTG CAT GCG ATG GAA 
lys tyr leu lys ser leu his ala met glu 

512 

ATC CTA AAC CAG CTG TTC CGA GTC CTC ACC 
ile leu asn gin leu phe arg val leu thr 

572 

GTG ACT CGG GTC ATT ATT CAT GTG GTT GCC 
val thr arg val ile ile his val val ala 

632 

TTG AGG TCA TAT GTT AAG TAC GCG TAT AAG 
leu arg ser tyr val lys tyr ala tyr lys 

692 

ACA GTG CAT GAA GAA CTG ACC AAA TCC ATG 
thr val his glu glu leu thr lys ser met 
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ACC ACQ ATT CTC AAG CCT TCT GCC GAT TTC CTC ACC AGC AAC AAA CTA CTG AGG TAC TCA 
thr thr ile leu lys pro ser ala asp phe leu tlxr ser asn lys leu leu arg tyr ser 

782 

TGG TTT TTC TTT GAT GTA CTG ATC AAA TCT AT6 GOT CAG CAT TTG ATA GAG AAC TCC AAA 
trp phe phe phe asp val leu ile lys ser met ala gin his leu ile glu asn ser lys 

842 Icadherln Cleavage | 872 

GTT AAG TTG CTG CGA AAC CAG A6A TTT CCT GCA TCC TAT CAT CAT GCA GCG GAA ACC GTT 
val lys leu leu arg asn gin arg phe pro ala ser tyr his his ala ala glu thr val 

932 

GTA AAT ATG CTG ATG CCA CAC ATC ACT CAG AAG TTT GGA GAT AAT CCA GAG GCA. TCT AAG 
val asn met leu met pro his ile thr gin lys phe gly asp asn pro glu^ ala ser lys 

962 

AAC GCG AAT CAT AGC CTT GCT GTC TTC ATC AAG AGA TGT TTC ACC TTC ATG GAC AGG GGC 
asn ala asn his ser leu ala val phe ile lys arg cys phe thr phe met asp arg gly 

1022 X052 

TTT GTC TTC AAG CAG ATC AAC AAC TAC ATT AGC TGT TTT GCT CCT GGA GAC CCA AAG ACC 

phe val phe lys gin ile asn asn tyr ile ser cys phe ala pro gly asp pro lys thr 

1082 XH2 

CTC TTT GAA TAC AAG TTT GAA TTT CTC CGT GTA GTG TGC AAC CAT GAA CAT TAT ATT CCO 
leu phe glu tyr lys phe glu phe leu arg val val cys asn his glu his tyr xle pro 

1142 

TTG AAC TTA CCA ATG CCA TTT GGA AAA GGC AGG ATT CAA AGA TAC CAA GAC CTC CAG CTT 
leu asn leu pro met pro phe gly lys gly arg ile gin arg tyr gin asp leu gin leu 

1232 |Cadherin EC 

i^C^AC TCA TTA ACA GAT GAG TTC TGC jicA AAC CAC TTC TTG GTG GGA CTG TTA CTG AGG 
asp tyr ser leu thr asp glu phe cys arg asn his phe leu val gly leu leu leu arg 

^ GTG GGG ACA GCC CTC CAG GAG TTC CGG GAG GTC CGT CTG ATC GCC ATC AGT GTG CTC 
glu val gly thr ala leu gin glu phe arg glu val arg leu ile ala ile ser val leu 

SI^AAC CTG CTG ATA AAG CAT TCT TTT GAT G^rAGA TAT GCT TCA AGG AGC CAT CAG GCA 
lys asn leu leu ile lys his ser phe asp asp arg tyr ala ser arg ser his gin ala 

1382 1412/471 

AGG ATA GCC ACC CTC TAC CTG CCT CTG TTT GGT CTG CTG ATT GAA AAC GTC CAG CGG ATC 

arg ile ala thr leu tyr leu pro leu phe gly leu leu xle glu asn val gin arg xle 

X472 

ili^GTG AGG GAT GTG TCA CCC TTC CCT GTG AAC GCG GGC ATG ACC GTG AAO GAT GAA TCC 
^n Zl asp val ser pro phe pro val asn ala gly met thr val lys asp glu ser 
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-_n, 1532 

CTG OCT CTA CCA GCT 6TG AAT CCG CTG GTG ACG COG CAG AAG 6GA AGC ACC CTO GAC AAC 
III SI 11^ llo ala val asn pro leu val thr pro gin lys gly ser thr leu asp asn 

CTO CAC AAG GAC CTG CTG GGC GCC ATC TCC GGC ATT GCT TCT CCA TAT ACA ACC TCA 
tS asp leu leu gly ala ile ser gly ile ala ser pro tyr thr thr ser 

1652 

icT CCA AAC ATC AAC A6T GTG AGA AAT GCT OAT TCG AGA GGA TCT CTC ATA AGC ACA GAT 
III p^ asn ile asn ser val arg asn ala asp ser arg gly ser leu xle ser thr asp 

1COO 1712 

icG GOT AAC AGC CTT CCA GAA AGG AAT AOT GAG AAG AGC AAT TCC CTG OAT AAG CAC CAA 

ser gly asn ser leu pro glu arg asn ser glu lys ser asn ser leu asp lys hxs gin 

^\6T AGC ACA TTO GGA AAT TCC GTG GTT C^^GT GAT AAA CTT GAC CAG TCT GAG ATT 

Sn ser ser thr leu gly asn ser val val arg cys asp lys leu asp gin ser glu xle 

X832 

^g\gC CTA CTG ATO TGT TTC CTC TAC ATC TTA AAG AGC ATG TCT OAT GAT GCT TTG TTT 

Jys ser leu leu met cys phe leu tyr ile leu lys ser met ser asp asp ala leu phe 

X892 

i^%AT TGG AAC AAG GCT TCA ACA TCT GAA CTT ATG GAT TTT TTT ACA ATA TCT GAA GTC 
tjr trp asn lys ala ser thr ser glu leu met asp phe phe thr xle ser glu val 

ilJ^CTG CAC CAG TTC CAG TAC ATG GGO AAG cl^TAC ATA GCC AGG AAC CAG GAG GGG TTG 
Ss gin phe gin tyr met gly lys arg tyr ile ala arg asn gin glu gly leu 

iona 2012 

^ CCC ATA OTT CAT OAT CGA AAG TCT CAG ACA TTO CCT GTT TCC CGT AAC AGA ACA GGA 
pro ile val his asp arg lys ser gin thr leu pro val ser arg asn arg thr gly 

oni-j 2072 

iSo ATO CAT GCC AGA TTO CAG CAG CTG GGC AGC CTO OAT AAC TCT CTC ACT TTT AAC CAC 

Zl met his ala arg leu gin gin leu gly ser leu asp asn ser leu thr phe asn hxs 

A^%AT GGC CAC TCG GAC GCA GAT GTT CTG C^^CAO TCA TTA CTT GAA GCC AAC ATT OCT 
t« ^yr gly his ser asp ala asp val leu his gin ser leu leu glu ala asn xle ala 

2X92 

ii"oAG GTT TGC CTG ACA GCT CTG GAC ACG CTT TCT CTA TTT ACA TTG GCG TTT AAG AAC 
tSr glu val cys leu thr ala leu asp thr leu ser leu phe thr leu ala phe lys asn 

2252 

lifcTC CTG GCC GAC CAT GGA CAT AAT CCT CTC ATG ^ ^ °™ ^ ^ Zl 

gin leu leu ala asp his gly his asn pro leu met lys lys val phe asp val tyr leu 
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2282 2312 



TGT TTT CTT CAA AAA CAT CAO TCT GAA ACQ GOT TTA AAA AAT GTC TTC ACT GCC TTA AGO 
cys phe leu gin lys his gin ser glu thr ala leu lys asn val phe thr ala leu arg 

2372 

tcJ^TTA ATT TAT AAG TTT CCC TCA ACA TTC TAT GAA GGG AGA GCG OAC ATG TGT GCG GCT 
ser leu ile tyr lys phe pro ser thr phe tyr glu gly arg ala asp met cys ala ala 

2402 2432 

CTG TGT TAG GAG ATT CTC AAG TGC TGT AAC TCC AAG CTG AGC TCC ATC AGG ACG GAG GCC 
leu cys tyr glu ile leu lys cys cys asn ser lys leu ser ser ile arg thr glu ala 

5Ag2 2492 

TCC CAG CTG CTC TAC TTC CTG ATG AGG AAC AAC TTT GAT TAC ACT GGA AAG AAG TCC TTT 
ser gin leu leu tyr phe leu met arg asn asn phe asp tyr thr gly lys lys ser phe 

GTC%GG ACA CAT TT6 CAA GTC ATC ATA TCT JII^AGC CAG CTG ATA 6CA GAC GTT GTT OGC 

val arg thr his leu gin val ile ile ser val ser gin leu ile ala asp val val gly 

2 612 

mJ^GGG GAA ACC AGA TTC CAG CAG TCC CTG TCC ATC ATC AAC AAC TGT GCC AAC AGT GAC 
ile gly glu thr arg phe gin gin ser leu ser ile ile asn asn cys ala asn ser asp 

2672 

CTC^CTT ATT AAG CAC ACC AGC TTC TCC TCT GAT GTO AAG GAC TTA ACC AAA AGG ATA CGC 
arg leu ile lys his thr ser phe ser ser asp val lys asp leu thr lys arg He arg 

2732 

ic^GTG CTA ATG GCC ACC GCC CAG ATG AAG GAG CAT GAG AAC GAC CCA GAG ATG CTG GTO 
thr val leu met ala thr ala gin met lys glu his glu asn asp pro glu met leu val 

JI"cTC CAG TAC AGC CTG GCC AAA TCC TAT GCC^GC ACG CCC GAG CTC AGG AAG ACG TGG 
asp leu gin tyr ser leu ala lys ser tyr ala ser thr pro glu leu arg lys thr trp 

2852 I jatxxxxxxxxaciaaac Predicted 

cS GAC AGC ATG GCC AGG ATC CAT GTC AAA AAT GGC GAT CTC TCA GAG 6CA GCA ATG TGC 
leu asp ser met ala arg ile his val lys asn gly asp leu ser glu ala ala met cys 

Transmembrane Domain xxxxxxxxxxxxxxxxxjaexMoacxx | 

tIt GTC CAC GTA ACA GCC CTA GTG GCA GAA TAT CTC ACA CGO AAA GGC GTG TTT AGA CAA 

Zl Ss llx ala leu val ala glu tyr leu thr arg lys gly val phe arg gin 

297 2 

Ili^TGC ACC GCC TTC AGG GTC ATT ACC CCA AAC ATC GAC GAG GAG GCC TCC ATG ATG GAA 
S Jhr phe arg val ile thr pro asn ile asp glu glu ala ser met met glu 

•*nn^ 3032 

GAC GTG GGG ATG CAG GAT GTC CAT TTC AAC GAG GAT GTG CTG ATG GAG CTC CTT GAG CAG 

val gly met gin asp val his phe asn glu asp val leu met glu leu leu glu gin 
^nc-j 3092 

^ GCA GAT GGA CTC TGG AAA GCC GAG CGC TAC GAG CTC ATC GCC GAC ATC TAC AAA CTT 
SI !ip leu trp lys ala glu arg tyr glu leu ile ala asp ile tyr lys leu 
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,1,, 3155 

ItC ATC CCC ATT TAT GAG AAG CGG AGG GAT TTC TTT GAA GAT GAA GAT GGA AAG GAG TAT 
ile ill pro lie tyr glu Xys arg arg asp phe phe glu asp glu asp gly lys glu tyr 

32X2 

m"tAC AAG GAA CCC AAA CTC ACA CCG CTG TCG GAA ATT TCT CAG AGA CTC CTT AAA CTG 
tie tyr lys glu pro lys leu thr pro leu ser glu ile ser gin arg leu leu lys leu 

3272 

tI^TCG gat AAA TTT GGT TCT GAA AAT GTC AAA ATG ATA CAG GAT TCT GGC AAG GTC AAC 

ser asp lys pHe gly ser glu asn val lys met ile gin asp ser gly lys val asn 

CC^AAG GAT CTG GAT TCT AAG TAT GCA TAC It^ CAG GTG ACT CAC GTC ATC CCC TTC ^T 
pro lys asp leu asp ser lys tyr ala tyr ile gin val thr bis val xle pro pHe pHe 

|^"gAA AAA GAG TTG CAA GAA AGG AAA ACA gII'tTT GAG AGA TCC CAC AAC ATC CGC CGC 

asp glu lys glu leu gin glu arg lys thr glu phe glu arg ser hxs asn xle arg arg 

tJc ATG TTT GAG ATG CCA TTT ACG CAG ACC OGG AAG AGO CAG GGC GGG GTG GAA GAG CAG 

III Zl phe glu met pro phe thr gin thr gly lys arg gin gly gly val glu glu gin 

3512 

tS^^AAA CGG CGC ACC ATC CTG ACA GCC ATA CAC TGC TTC CCT TAT GTG AAG AAG CGC ATC 
cS tj^ 2^ arg thr ile leu thr ala ile his oys phe pro tyr val lys lys arg xle 

TC^GTC ATG TAC CAG CAC CAC ACT GAC CTG AAC CCC ATC GAG GTG GCC ATT GAC GAG ATG 
pr! Ill ^ Sr gin his his thr asp leu asn pro ile glu val ala ile asp glu met 

3602 XXX Coxled-Coil 1 xxxxxxKXXXXXX 3632 xxxx Coiled-Coil 1 xxxxxxxxxxxxx 
iG^^^AAG GTG GCG GAG CTC CGG CAG CTG TGC TCC TCG GCC GAG GTG GAC ATG ATC AAA 
^11 lys val ala glu leu arg gin leu cys ser ser ala glu val asp met xle lys 

l^fc^r^^r^^^^^^r^^'AGC GTG AGT l^'cAG GTC AAT GCT GGC CCA CTA GCA TAT 

leu gin leu lys leu gin gly ser val ser val gin val asn ala gly pro leu ala tyr 

S%GA GCT TTC TTA GAT GAT ACA AAC ACA llf CGA TAT CCT GAC AAT AAA GTG AAG CTG 
ala arg ala phe leu asp asp thr asn thr lys arg tyr pro asp asn lys val lys leu 

CT^AAG GAA GTT TTC AGG CAA TTT GTG GAA GCT TGC GGT CAA GCC TTA GCG GTA AAC GAA 
llu ijs glu val phe arg gin phe val glu ala cys gly gin ala leu ala val asn glu 



FIG. 1 (cont.) 



6/87 



- - ■■, rt ■> ^ooocxxxxxxxx 3872 xxxxxxxxxxxxxxxxxxMOCxMaaaacxxxxxx 

£=S^ sr^rl s - ^ - "1 s 

3992 

i^%.C r.. ceo TCC CT. «C .XC TTC OCC .TC .O. ^ JCT CC. .0. .CO .C. 

ser val leu pro asn ser leu his ile phe asn ala lie ser gxy v 

^^^^ PBM xxxx| 

mI gtt cac ogg atg acc agc tcg tct tcg gtc gtg tga tta cat ctc atg gcc cgt gtg 
met val his gly met thr ser ser ser ser val val STP 

GGA CTT GCT TTG TCA TTT oca AAC TCA GGA TGC TTT CCA AAG CCA ATC ACT GGG GAG 
AC^GAG CAC AGG GAG GAC CAA GGG GAA GGG GAG AGA AAG GAA ATA AAG AAC AAC GTT ATT 

JcJ%AA CAG ACT TTC TAT AGG AGT TGT AAG li^GTG CAC ATA TTT TTT TAA ATC TCA CTG 

4292 

^\tA TTC AAA GTT TTC ATT GTG TCT TAA CAA AGG TGT GGT AGA CAC TCT TGA GCT GGA 
ct"aQA TTT TAT TCT TCC TTG CAG AGT AGT Jct\gA ATA GAT GGC CTA CAG AAA AAA AAG 
GTT^TG GGA TCT ACA TGG CAG GGA GGG CTG J1"tGA CAT TGA TGC CTG GGG GAC CTT TTG 
4442 



JJt CGA CTC GTG CCG GAA ATC TGA TCG TAA TCA GGG TAC AGA ACT TAC TAG TTT TGT CTA 
JI^'gTA TGT TGT ATG ACT AGG ATT TGT GCT ill\tC TCA TTC AAC AAC ATA GAG CAA GAA 

J^g'tGA GCT AAC TGA GCT AGA CAC TCA ATT ^t'cCG CTA CTG GCT TCA AGT CAG AAC TTT 

.^oo 4652 

GtI ATT AAT CAT CGA CTC CGG GAC GGT CAT ATA TGT ATT ACA TTT CTA CAT TTT TAA TAC 
.^oo 4712 

T^ CAT GGG CTT ATG CAT TAA GTT TAA TTG TGA TAA ATT TGT GCT GGT CCA GTA TAT GCA 

4772 

ilf CAC TTT AAT GGT TTA TTC TTG TCA TAA AAA TGT GCA ATA TGG AGA TGT ATA CAA GTC 



4802 
TTT ACT 
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1 

A 



2 32 



GTT TTA CAC CAT CAC CAA AAC CCA GAA TTT TAT GAT GAG ATT AAA ATA GAG TTG CCC ACT 
val leu his his his gin asn pro glu phe tyr asp glu ile lys ile glu leu pro thr 



62 92 



CAG CTG CAT GAA AAG CAC CAC CTG TTG CTC ACA TTC TTC CAT GTC AGO TGT GAC AAC TCA 

gin leu his glu lys his his leu leu leu thr phe phe his val ser cys asp asn ser 



122 152 



A6T AAA G6A AGO ACG AAG AAG AGO GAT GTC GTT GAA ACC CAA GTT GGC TAC TCC TGG CTT 
ser lys gly ser thr lys lys arg asp val val glu thr gin val gly tyr ser trp leu 



182 212 



CCC CTC CTG AAA GAC GGA AGG GTG GTG ACA AOC GAG CAG CAC ATC CCG GTC TCG GCG AAC 
pro leu leu lys asp gly arg val val thr ser glu gin his ile pro val ser ala asn 



242 272 



CTT CCT TCG GGC TAT CTT GGC TAC CAA GAG CTT GGG ATG GGC AGG CAT TAT GGT CCG GAA 

leu pro ser gly tyr leu gly tyr gin glu leu gly met gly arg his tyr gly pro glu 



302 "2 



ATT AAA TGG GTA GAT OGA GGC AAG CCA CTG CTG AAA ATT TCC ACT CAT CTG GTT TCT ACA 

lys trp val asp gly gly lys pro leu leu lys ile ser thr his leu val ser thr 



362 392 



GTG TAT ACT CAG GAT CAG CAT TTA CAT AAT TTT TTC CAG TAC TGT CAG AAA ACC GAA TCT 
val tyr thr gin asp gin his leu his asn phe phe gin tyr cys gin lys thr glu ser 



422 *52 



JgA GCC CAA GCC TTA GGA AAC GAA CTT GTA AAG TAC CTT AAG AGT CTG P^'^ ^ ^ 

gly ala gin ala leu gly asn glu leu val lys tyr leu lys ser leu h3.s ala met glu 



482 



G^ CAC GTG ATG ATC GCC TTC TTG CCC ACT ATC CTA AAC CAG CTG TTC CGA GTC CTC ACC 
Ss m met ile ala phe leu pro thr ile leu asn gin leu phe arg val leu thr 



542 572 



AGA GCC ACA CAG GAA GAA GTC GCG GTT AAC GTG ACT CGG GTC ATT ATT CAT GTG GTT GCC 
Sa glu glu val ala val asn val thr arg val ile ile his val val ala 



602 "2 



^G TGC CAT GAG GAA GGA TTG GAG AGC CAC TTG AGG TCA TAT GTT AAG TAC GCG TAT AAG 

gin cys his glu glu gly leu glu ser his leu arg ser tyr val lys tyr ala tyr lys 

692 

^ GAG CCA TAT GTT GCC TCT GAA TAC AAG ACA GTG CAT GAA GAA CTG ACC AAA TCC ATG 
SI Su lyl Jal ala ser glu tyr lys thr val his glu glu leu thr ays ser met 
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ACC ACQ ATT CTC AAG CCT TCT GCC GAT TTC CTC ACC AGC AAC AAA CTA CTG AGG TAC TCA 
thr thr ile leu lys pro ser ala asp phe leu thr ser asn lys leu leu arg tyr ser 

ill TTT TTC TTT GAT GTA CTG ATC AAA TCT HI GCT CAO CAT TTG ATA GAG AAC TCC AAA 
IZIZ Se Phe asp val leu ile lys ser met ala gin his leu ile glu asn ser lys 

oAo Icadherin Cleavage! 872 

I" AAG TTG CTG CGA AAC CAG AGA. Tri CCT GCA TCC TAT CAT CAT GCA GCG GAA ACC GTT 
III lys leu leu arg asn gin arg phe pro ala ser tyr His His ala ala glu thr val 

932 

III AAT ATG CTG ATG CCA CAC ATC ACT CAG AAG TTT GGA GAT AAT CCA GAG GCA TCT AAG 
val asn met leu met pro his ile thr gin lys phe gly asp asn pro glu ala ser lys 

992 

IS: GCO AAT CAT AGC CTT GCT GTC TTC ATC AAG AGA TGT TTC ACC TTC ATG GAC AGG GGC 
^ «n his ser leu ala val phe ile lys arg cys phe thr phe met asp arg gly 

ilT GTC TTC AAG CAG ATC AAC AAC TAC ATT AGC TGT TTT GCT CCT GGA GAC CCA AAG ACC 

IZ val phe lys gin ile asn asn tyr ile ser cys phe ala pro gly asp pro lys thr 

1 na-} m2 

CTC TTT GAA TAC AAG TTT GAA TTT CTC CGT GTA GTG T6C AAC CAT GAA CAT TAT ATT CCG 
Si HI Pl^e glu phe leu arg val val cys asn his glu his tyr ile pro 

1-1 AO in2 

TTG AAC TTA CCA ATG CCA TTT GGA AAA GGC AGG ATT CAA AGA TAC CAA GAC CTC CAG CTT 

Zl «n Zu pro Phe gly lys gly arg ile gin arg tyr gin asp leu gin leu 

2L232 |Cadlierin EC 

S^TAC TCA TTA ACA GAT GAG TTC TGC AGA AAC CAC TTC TTG GTG GGA CTG TTA CTG AGG 
lyl sS ™ asp glu Phe cys arg asn his phe leu val gly leu leu leu arg 

I 1292 
^'gTG GGG ACA GCC CTC CAG GAG TTC CGG GAG GTC CGT CTG ATC GCC ATC AGT GTG CTC 
glu val gly thr ala leu gin glu phe arg glu val arg leu ile ala ile ser val leu 

il^AAC CTG CTG ATA AAG CAT TCT TTT GAT Ia^AGA TAT GCT TCA AGG AGC CAT CAG GCA 
ijs «n leu leu ile lys his ser phe asp asp arg tyr ala ser arg ser hxs gin ala 

ill^TA GCC ACC CTC TAC CTG CCT CTG TTT ll"cTG CTG ATT GAA AAC GTC CAG CGG ATC 
S ihr leu tyr leu pro leu phe gly leu leu ile glu asn val gin arg xle 

il^GTG AGG GAT GTG TCA CCC TTC CCT GTG AAC GCG GGC ATG ACC GTG AAG GAT GAA TCC 
asn val arg asp val ser pro phe pro val asn ala gly met thr val iys asp g 
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CTG GCT CTA CCA GCT GTG AAT CCG CTG GTG ACG CCG CAG AAO OGA AGO ACC CTG GAC AAC 
III ala leu pro ala val asn pro leu val thr pro gin lys gly. ser thr leu asp asn 

i^^CTG CAC AAO GAC CTG CTG GGC GCC ATC ilc^GC ATT GCT TCT CCA TAT ACA ACC TCA 
sTr Ss «P leu leu gly ala ile ser gly ile ala ser pro tyr thr tlxr ser 

icfcCA AAC ATC AAC AGT GTG AGA AAT GCT gI^TCG AGA GGA TCT CTC ATA AGC ACA GAT 
thr pro asn ile asn ser val arg asn ala asp ser arg gly ser leu xle ser thr asp 

icG^GGT AAC AGC CTT CCA GAA AGG AAT AGT JIo^AAG AGC AAT TCC CTG GAT AAG CAC CAA 
ser gly asn ser leu pro glu arg asn ser glu lys ser asn ser leu asp lys hxs gin 

^\gT AGC ACA TTO GGA AAT TCC GTG GTT C^^GT GAT AAA CTT GAC CAG TCT GAG ATT 
Si t« t« thr leu gly asn ser val val arg cys asp lys leu asp gin ser glu xle 

1832 

il^AGC CTA CTG ATG TGT TTC CTC TAC ATC TTA AAG AGC ATG TCT GAT GAT GCT TTG TTT 
i^s ser leu leu met cys phe leu tyr ile leu lys ser «et ser asp asp ala leu phe 

1892 

i^%AT TGG AAC AAG GCT TCA ACA TCT GAA CTT ATG GAT TTT TTT ACA ATA TCT GAA GTC 
Sr trp asn lys ala ser thr ser glu leu met asp phe phe thr ile ser glu val 

1952 I xxxxaooacHaauacxxaaaaac 

i^^TG CAC CAG TTC CAG TAC ATG GGG AAG CGA TAC ATA GCC AGG AAC CAG GAG GGG TTG 
oys leu his gin phe gin tyr met gly lys arg tyr ile ala arg asn gin glu gly leu 

1982 xxxxxxxxxx deleted in CLASP-2D(KIAA1058) xxxxxxxxxxxxmcxxxxxwqi | 

cJt gat CGA AAG TCT CAG ACA TTG CCT GTT TCC CGT AAC AGA ACA GGA 
gly pro ile val his asp arg lys ser gin thr leu pro val ser arg asn arg thr gly 

onA5 2072 

i^G ATG CAT GCC AGA TTG CAG CAG CTG GGC AGC CTG GAT AAC TCT CTC ACT TTT AAC CAC 
met his ala arg leu gin gin leu gly ser leu asp asn ser leu thr phe asn hxs 

A^%AT GGC CAC TCG GAC GCA GAT GTT CTG clc^AG TCA TTA CTT GAA GCC AAC ATT GCT 
sTr gly his ser asp ala asp val leu his gin ser leu leu glu ala asn xle ala 

Deleted 

2192 I xxK 

icf GAG GTT TGC CTG ACA GCT CTG GAC ACG CTT TCT CTA TTT ACA TTG GCG TTT AAG AAC 
til glu val oys leu thr ala leu asp thr leu ser leu phe thr leu ala phe lys asn 

2252 

^'cTC CTG GCC GAC CAT GGA CAT AAT CCT CTC ATG AAA AAA GTT TTT GAT GTC TAC CTG 
III leu ala asp his gly his asn pro leu met lys lys val phe asp val tyr leu 
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23 X2 

IITtTT CTT CAA AAA cat CAG TCT GAA ACG OCT TTA AAA AAT GTC TTC ACT 6CC TTA AGG 
cys Se gin lys his gin ser glu thr ala leu lys asn val phe thr ala leu arg 

tc"tTA ATT TAT AAG TTT CCC TCA ACA TTC tIt^GAA GGG AGA GCG GAC ATG TGT GCG GCT 
ser lei tie tyr lys phe pro ser thr phe tyr glu gly arg ala asp met cys ala ala 

->/in-7 2432 

CTG TGT TAC GAG ATT CTC AAG TGC TGT AAC TCC AAG CTG AGC TCC ATC AGG ACG GAG GCC 
leu cys tyr glu ile leu lys cys cys asn ser lys leu ser ser He arg thr glu ala 

2492 

TCC CAG CTG CTC TAC TTC CTG ATG AGG AAC AAC TTT GAT TAC ACT GGA AAG AAG TCC TTT 
III Sn leu leu tyr phe leu met arg asn asn phe asp tyr thr gly lys lys ser phe 

CTC%GG ACA CAT TTG CAA GTC ATC ATA TCT GT^AGC CAG CTG ATA GCA GAC GTT GTT GGC 
aS thr his leu gin val ile ile ser val ser gin leu ile ala asp val val gly 

ocjo-) 2612 

GGG GAA ACC AGA TTC CAG CAG TCC CTC TCC ATC ATC AAC AAC TGT GCC AAC AGT GAC 

gly glu thr arg phe gin gin ser leu ser ile ile asn asn cys ala asn ser asp 

C^%TT ATT AAG CAC ACC AGC TTC TCC TCT g1"gTG AAG GAC TTA ACC AAA AGG ATA CGC 
aS lys his thr ser phe ser ser asp val lys asp leu thr lys arg xle arg 

ic6%TG CTA ATG GCC ACC GCC CAG ATG AAG gI^CAT GAG AAC GAC CCA GAG ATG CTG GTG 
thr val leu met ala thr ala gin met lys glu his glu asn asp pro glu met leu val 

2792 

|I"cTC CAG TAC AGC CTG GCC AAA TCC TAT GCC AGC ACG CCC GAG CTC AGG AAG ACG TGG 
asp leu gin tyr ser leu ala lys ser tyr ala ser thr pro glu leu arg lys thr trp 

2 852 I xxxxxxxxxxxxxxx Predicted 

elf GAC AGC ATG GCC AGG ATC CAT GTC AAA AAT GGC GAT CTC TCA GAG ^ «^ 

leu asp ser met ala arg ile his val lys asn gly asp leu ser glu ala ala met cys 

[Additional and differential exon usage found Hd^^''^''^ 
of 69 nucleotides. This entire se<iuence is found xn Human CLASP-2D 
<KIAA1058) and not other isoforms of CIASP-2. It has a se<iuence of: 
AAGCAGTCCAGTGGGAGCCGCCCCTTCTCCCCCACAGCCATAGCGCCTGCCTGAGGAGGAGCCGGGGAG] 

Transmembrane Domain xxxxxxxjoocioQacKxxxxxxxxMac | W 

TAT GTC CAC GTA ACA GCC CTA GTG GCA GAA TAT CTC ACA CGG AAA dfeC GTG TTT AGA GAA 
Sr val his val thr ala leu val ala glu tyr leu thr arg lys gly val phe arg gin 

II^TGC ACC GCC TTC AGG GTC ATT ACC CCA ^c'aTC GAC GAG GAG GCC TCC ATG ATG OAA 
Sy eye thr ala phe arg val ile thr pro asn ile asp glu glu ala ser met met glu 
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2S III III ^l III - IZ III III Zl ... Xeu Xeu .Xu ..Xn 

3062 xxxxxxxxxaoaaouacxxl m^^^rar CTC ATC GCC GAC ATC TAG AAA CTT 

S ^ S S° S S° - - - - - 

ACAGACAGTGAAACAGATGTGGAGGGATT] . 

S\tc ccc .tt »o «o coo ^ oxt J 'ZT.^lTr S= S° tj^ 

iXe iXe pro iXe tyr gXu Xys arg arg asp pUe phe gXu asp giu asp g y 

32X2 



3332 



rlib - I- ^ ^ ^ ^. tr. i- zi ^ 

3392 

i S L-? =S ^ S° Tkl S» S Zl IT. r« s» 

3512 

34 82 „ CCT TAT GTG AAG AAG CGC ATC 

^. ^, s ts =r= r~ 

TWO n»=1.0tia. a.lMlon (.t» 3586 ».d 35.7) «o«^ in CJJiSP-2C 

3572 |xxx| 
3542 ^„ ^TC GAG GTG GCC ATT GAC GAG ATG 

^^.^^.T^ S °~ - - ^- - 
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3602 

AGT AAG AAG GTG 
ser lys lys val 

3662 

CTG CAG CTC AAA 
leu gin leu lys 

3722 

GCG CGA GCT TTC 
ala arg ala plie 

3782 

CTT AAG GAA GTT 
leu lys glu val 

3842 

CGT CTG ATT AAA 
arg leu lie lys 



GCG GAG CTC CGG 
ala glu leu arg 



CTC CAG GGC AGC 
leu gin gly ser 

TTA GAT GAT ACA 
leu asp asp thr 

TTC AGG CAA TTT 
phe arg gin phe 

GAA GAC CAG CTC 
glu asp gin leu 



3632 
CAG CTG TGC TCC 
gin leu cys ser 

3692 
GTG AGT GTT CAG 
val ser val gin 

3752 

AAC ACA AAG CGA 
asn thr lys arg 

3812 

GTG GAA GCT TGC 
val glu ala cys 

3872 

GAG TAT CAG GAA 
glu tyr gin glu 



TCG GCC GAG GTG 
ser ala glu val 

GTC AAT GCT GGC 
val asn ala gly 



TAT CCT GAC AAT 
tyr pro asp asn 

GGT CAA GCC TTA 
gly gill a-^a- 



GAA ATG AAA GCC 
glu met lys ala 



GAC ATG ATC AAA 
asp met lie lys 

CCA CTA GCA TAT 
pro leu ala tyr 

AAA GTG AAG CTG 
lys val lys leu 

GCG GTA AAC GAA 
ala val asn glu 

AAC TAC AGG GAA 
asn tyr arg glu 



insertion of 8 nucleotides, found only in Human CIASP-2D with sequence. CTGGGATG 

AT 



3902 3932 



ATG GCG AAG GAG CTT TCT GAA ATC ATG CAT GAG CAG \tC TGC CCC CTG GA6 GAG AAG AGO 
«;t ala lys glu leu ser glu ile met his glu gin ile cys pro leu glu glu lys thr 



i^'^GTC TTA CCG AAT TCC CTT CAC ATC TTC Jll^GCC ATC AGT GGG ACT CCA ACA AGC ACA 
ser val leu pro asn ser leu his ile phe asn ala ile ser gly thr pro thr ser thr 

An99 Ixxxx PBM xxxx| 

ATG GTT CAC GGG ATG ACC AGC TCG TCT TCG GTC GTG TGA TTA CAT CTC ATG GCC CGT GTG 
met val his gly met thr ser ser ser ser val val STP 

4082 

TGO GGA CTT 



GCT TTG TCA TTT GCA AAC TCA GGA TGC TTT CCA AAG CCA ATC ACT GGG GAG 



4142 



ACC GAG CAC AGG GAG GAC CAA GGG GAA GGG GAG AGA AAG GAA ATA AAG AAC AAC GTT ATT 



Jct^TAA CAG ACT TTC TAT AGO AGT TGT AAG AAG GTG CAC ATA TTT TTT TAA ATC TCA CTG 



4232 



S^ATA TTC AAA GTT TTC ATT GTG TCT TAA CAA AGG TGT GGT AGA CAC TCT TGA GCT GGA 



4292 



Jt"aGA TTT TAT TCT TCC TTG CAG AGT AGT CT^'aGA ATA GAT GGC CTA CAG AAA AAA AAG 



4352 



CTT^TG GGA TCT ACA TGG CAG GGA GGG CTG ^c"tGA CAT TGA TGC CTG GGG GAC CTT TTG 



4412 
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4442 



JcT CGA CTC GTG CCG GAA ATC TGA TCG TAA TCA OGO TAC AGA ACT TAC TAG TTT TGT CTA 
Jli GTA TGT TGT ATG ACT AGG ATT TGT GCT ATT ATC TCA TTC AAC AAC ATA GAG CAA GAA 
Ji^ TGA GCT AAC TGA GCT AGA CAC TCA ATT AAT CCG CTA CTG GCT TCA AGT CAG AAC TTT 
GTf ATT AAT CAT CGA CTC CGG GAC GGT CAT ^T^TGr ATT^ACA TTT CTA CAT TTT TAA TAC 
J^^'cAT GGG CTT ATG CAT TAA GTT TAA TTG J^^AA ATT TGT GCT GGT CCA GTA TAT GCA 



4772 



i^A CAC TTT AAT GGT TTA TTC TTG TCA TAA AAA TGT GCA ATA TGG AGA TGT ATA CAA GTC 



4802 
TTT ACT 
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HC2A _ 

HC2-80 - " 1 

HC2B " I I 

HC2C — — — — — 

HC2D-KIAA1058 GCATCTGGAAATCTTGACAAAAATGCCAGATTTTCTGCCysLTCTACAGGCAAGACAGC^ 

HC2E "I I 

HC2F 

HC2A II'IIIII'IIIIIIIIIIIII" 

HC2-80 . I ZZJiZZZJi 

HC2B ~ _ " 1 3 mill 

HC2C -~ — "~ — .~ *" "~ 

HC2D-KIAA1058 aAGCTATCCAATGATGACATGCTCAAGTTACTTGCAGACTTTCGGAAACC^ 

HC2E I~~~"IIIIIZII I I 

HC2F '■" 



HC2A 

HC2-80 "~ " 1 

HC2B ~~ *" IIIIIII I I 

Sc2D-KIAAl058 GCTAAGCTCCCAGTGATTTTAGGCy^TCT^ 

HC2E 2 "III" 

HC2F 

HC2A II"IIII~I"IIIIIIIIII"- 

HC2-80 " " 'Ji I II 

HC2B I.IIIII 

Sc2D-KIAAl058 iTCCCTAlTTATGTTLvTTCATCATAC^^ 

HC2E zz~z mill II 

HC2F ' 

HC2A " " II II IIIIIIIIIII 

HC2-80 " I _ _I_IIIII 

HC2B ~ _ " I_I_III I 

Sc2D-KIAA1058 IcTCC^T^COTTi^GT^GGAATTT^^ 

HC2E ~ iiiii"immi„i I 

HC2F 
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HC2A _ 

HC2-80 _~ 

HC2B III". I I 

HC2C _ — — 

HC2D-KIAA1058 tACACCATCTACACCAATCACCTTTACGTTTATCCTAAGTACTTGAAATACG^ 

HC2E IIIIIIII IIIII- 

HC2F 

HC2A "IIIII'IIIIIIIIII"" 

HC2-80 IIIIIII I III" 

HC2B """I" I IIIIII 

HC2C ——————————————————— — — — — — — — 

HC2D-KIAA1058 aAGTCTTTTGCCAAGGCTAGAAATATTGCGATTTGCATTGAATTCAAA 

HC2E : " I I 

HC2F 

HC2A IIIIIIIII"""" 

HC2-80 " " 

HC2B ; "~_ 1 I 

HC2C — — — — — — — — — — — — — — — — — 

HC2D-KIAA1058 CAAGACTCTCy^CCCTTAAGTGCATTTATGGCAGACCTGGTGGGCCAGTTTTCA^ 

HC2E ZZ" ~I~"I""II"IIIII~~-I " 

HC2F 

HJ.2A AGTTTTACACCATCACCAAAACCCAGAATTTTATGATGAGATTAAA 

HC2-80 ""IIIIIIII 

HC2B IIIIII~II"II_I-I I I II— 

HC2C — — — — — — _ — — — — — — — — — — — 

HC2D-KIAA1058 aGCGCCTTTGCTGCAGTTTTACACCATCACCAAAACCCAGAATTTT^^ 

HC2E II'IIII II" I" I-I I-I-I 

HC2F 

HC2A ATAGAGTTGCCCACTCAGCTGCATGAAAAGCACCACCTGTTGCTCy^CATTCTT^ 

HC2-80 ^ II~II~III"IIIIIIIIII 

HC2B ~ _ " 1 III" I 

HC2C — — 

HC2D-KIAA1058 aTAGAGTTGCCCACTCAGCTGCATGAAAAGCACCACCTGTTGCTC 

HC2E I"II"III IIII II 

HC2F ' 

J1C2A aGCTGTGACAACTCAAGTAAAGGAAGCACGAAGAAGAGGGATGTCGTTGA^ 

HC2-80 III~II"IIIIIIIIIIIIII 

HC2B ^_ " I_I_III_II 

Ilc2D-KIAAl058 IgCTCTGAcIaCTcIIgTAAAGGA^^ 

HC2E " 1 mil 

HC2F 
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GGCTACTCCTGGCTTCCCCTCCTGAAAGACGGAAGGGTGGTGACAAGCGAGCAGCACATC 

HC2-80 

HC2B "IIII I I IIIIII 

HC2C — ** ** ~ 

HC2D-KIAA1058 GGCTACTCCTGGCTTCCCCTCCTGAAAGACGGAAGGGTGGTGACAAGCGAGCAGCyVCATC 

HC2E mil III IIIIIIIIIIIIIII 

HC2F 

HC2A CCGGTCTCGGCGAACCTTCCTTCGGGCTATCTTGGCTACCAAGAGCTTGGGATGGGC^ 

HC2-80 IIIIIIIIIIIIIIIIIIIIIIII 

HC2B III^ IIIIIII 

!lc2D-KIAA1058 cCGGTcicGGCGAACCTTCCTTCGGGCTATCTTGGCTAC^ 

HC2E III~""II IIIIIII III 

HC2F 

HC2A CATTATGGTCCGGAAATTAAATGGGTAGATGGAGGCAAGCCACTGCTGAAAAM 

HC2-80 III"I~II""I IIIIIII 

HC2B " II_III_I I„I 

HC2D-KIAA1058 ^iilT^icC^^TTAAATGGGTAGATGGAGGCA^ . 

HC2E III~I~I"IIII~I III-II 

HC2F 

JJC2A CATCTGGTTTCTACAGTGTATACTCAGGATCAGCATTTACATAATTTTTTCCACT^ 

HC2-80 mil IIIIIIIII 

HC2B " III_I_I 

HC2D-KIAA1058 ^TCTGciTTCiACAGTCTATACTCAGGATCAGCATTTA^^ 

HC2E "I I 'lIIIIIIIIIIII I-I 

HC2F 

HC2A CAGAAAACCGAATCTGGAGCCCAAGCCTTAGGAAACGAACTTGTAAAGT^^ 

HC2-80 I"I~IIIIIIIIIII 

HC2B ■ II"II_II-I_I I_I_IIIIII 

HC2D-KIAA1058 CAG^^CC^ici^GCCCAAGCCTTAGGAAAC^ 

HC2E I"III~I~II mil I II 

HC2F 

HC2A CTGCATGCGATGGAAGGCCACGTGATGATCGCCTTCTTGCCCACTATCC^ 

Jc2B^° IIIIII^^t^^c^cgtgItgatcgccttcttgc^ 

lIc2D-KIAAl058 IVgIIVg^^^^^ 

Sc2E '"'^'^ gcgatggaaggccacgtgatgatcgccttcttgcccactatcct/^^ 

HC2F '■ 
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HC2A TTCCGAGTCCT(^CCAGAGCCACAa^GGAAGAAGTCGCGGTTAACGT6ACTCGGGTCAT^ 

HC2B^° TicCGAGTCCTcicC^G^GCCACACAGGAAGAAGTCGCGGTTJ^^ 

HC2D-KIAA1058 iTCC^OTCCT^C^^GCCACACAGGAAGAAGTCGCGGTTAACGTGACTCGGGTCy^^ 

HC2D KIAA1058 j^cCGAGTCCTCACCAGAGCCACACAGGAAGAAGTCGCGGTTAACGTGACTCGGGTC^^ 

HC2F 

JJC2A ATTCATGTGGTTGCCCAGTGCCATGAGGAAGGATTGGAGAGCCACTTGAGGTCATATCT 

HC2B^° ATi^TGT^TTGCCCAGTGCCATGAGGAAGGATTGGAGAGCCA^ 

Sc2D-KIAA1058 ATi^iGi^iiGCC^Gi^CATGAGGAAGGATTGGAGAGCCACTTGAGGTCa.TATGTT 

HC2D KIAA1058 Jii^jeieeTiecCCAGTGCCATGAGGAAGGATTGGAGAGCCACTTGAG^^ 

HC2F ' 

HC2A ju^GTACGCGTATAAGGCTGAGCCATATGTTGCCTCTGAATACAAGACAGTGCATGAAGAA 

HC2B^° ^GTACGCGTliilGGCTGAGCCATATGTTGCCTCTGAATACAAGAOVGTGCATGAAGA^ 

Sc2D-KIAA1058 ^gtIc^CTAtIIgGCTGAGCCATATGTTGCCTCTGAATACAAGACAGTGCATGAAGAA 

HC2D KIAA1058 JJej;,cGCGTATAAGGCTGAGCCATATGTTGCCTCTGAATACAAGACAGTG^^ 

HC2F 

HC2A CTGACCAAATCCATGACCACGATTCTCy^GCCTTCTGCCGATTTCCTCACCAGCAA 

HC2B^° CTcIcCAllTCclTGACCAckTTCTCAAGCCTTCTGCCGA^^ 

Sc2D-KIAA1058 CTGAC^TC^T^C^CGATTCTCAAGCCTTCTGCCGATTTCCTCACCAGCAACAAA 

HC2D KIAA1058 ^js^cCAAATCCATGACCACGATTCTCAAGCCTTCTGCCGATTTCCTC^^ 

HC2F 

HC2A cTACTGAGGTACTCATGGTTTTTCTTTGATGTACTGATCAAATCTATGGCTCAGCATTTC 

HC2B^° CTACT^GGiAci^T^iiTTTciiTGATGTACTGATCA^ 

I!c2D-KIAA1058 cilcT^GTAci^i^TiTiTCTiTGATGTACTGATCAAATCTATGGCTCA^ 

HC2D KIAA1058 cTACTGAGGTACTCATGGTTTTTCTTTGATGTACTGATCAAATCTATGGCTCAGC^^ 

HC2F 

HC2A ATAGAGAACTCC^AAGTTAAGTTGCTGCGAAACCAGAGATTTCCTGCATCCTM 

HC2B^° ATA^^CTC^GTT^GTT^TGCGAAACCAGAGATTTCC^ 

IIc2D-KIAA1058 AiA^^cic^GiillGTi^TGCGAAACCAGAGATTTCCTGCATCCTATCAT^^ 

HC2D KIAA1058 ^iJ^^y^cTCCAAAGTTAAGTTGCTGCGAAACCAGAGATTTCCTGCATC^ 

HC2F 
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HC2A GCAGCGGAAACCGTTGTAAATATGCTGATGCCACACATCACTCAGAAGTTTGGAGATAAT 

jip2— 80 — — — — — — — 

HC2B GCAGCGGAAACCGTTGTAAATATGCTGATGCCACACATCACTCAGAAGTTTGGAGATAAT 

HC2C 

HC2D-KIAA1058 gcagtggaaaccgttgtaaatatgctgatgccacacatcactcagaagtttcgagataat 

HC2E GCAGCGGAAACCGTTGTAAATATGCTGATGCCACACATCACTCAGAAGTTTGGAGATAAT 

HC2F 

HC2A CCAGAGGCATCTAAGAACGCGAATCATAGCCTTGCTGTCTTCATCAAGAGATGTTTCACC 

HC2 — 80 " 

HC2B CCAGAGGCATCTAAGAACGCGAATCATAGCCTTGCTGTCTTCATCAAGAGATGTTTCACC 

HC2C — — — — — — — 

HC2D-KIAA1058 cCAGAGGCATCTAAGAACGCGAATCATAGCCTTGCTGTCTTCATCAAGAGATGTTTCACC 

HC2E CCAGAGGCATCTAAGAACGCGAATCATAGCCTTGCTGTCTTCATCAAGAGATGTTTCACC 

HC2F 

HC2A TTCATGGACAGGGGCTTTGTCTTCAAGCAGATCAACAACTACATTAGCTGTTTTGCTCCT 

•T/-iQ_QQ — — — — — — *" — — — 

HC2B TTCATGGACAGGGGCTTTGTCTTCAAGCAGATCAACAACTACATTAGCTGTTTTGCTCCT 

HC2C — ——— ——— — — — — — — 

HC2D-KIAA1058 ttcatggacaggggctttgtcttcaagcagatcaacaactacattagctgttttgctcct 

HC2E ttcatggacaggggctttgtcttcaagcagatcaacaactacattagctgttttgctcct 

HC2F 

HC2A ggagacccaaagaccctctttgaatacaagtttgaatttctccgtgtagtgtgcaaccat 

. upo— 80 ———————————————————— — —"~—~~*"~~"~""~"*"~ 

HC2B ggagacccaaagaccctctttgaatacaagtttgaatttctccgtgtagtgtgcaaccat 

HC2C — ~ — — — — — — — — — — — — — ————— — — 

HC2D-KIAA1058 gGAGACCCAAAGACCCTCTTTGAATACAAGTTTGAATTTCTCCGTGTAGTGTGCAACCAT 

HC2E ggagacccaaagaccctctttgaatacaagtttgratttctccgtgtagtgtg^ 

HC2F 

HC2A gaacattatattccgttgaacttaccaatgccatttggaaaaggcaggattcaaagatac 

2 80 — — — — — — — 

HC2B gaacattatattccgttgaacttaco^tgccatttggaaaaggcaggattcaaagat^ 

HC2C — — — ■ — — — — — ——— — ——— — — — — 

HC2D-KIAA1058 gAACATTATATTCCGTTGAACTTACCAATGCCATTTGGAAAAGGCAGGATTCAAAGATAC ■ 

HC2E gaacattatattccgttgaacttaccaatgccatttggaaaaggcaggattcaaagatac 

HC2F 

caagacctccagcttgactactcattaacagatgagttctgcagaaaccacttcttggtg 
„"_RQ tccagcttgactactcattaacagatgagttctgcagaaaccacttcttggtg 

HC2B CAAGACCTCCy^GCTTGACTACTCATTAACAGATGAGTTCTGCAGAAACCACTTCTT^ 

HC2D-KIAA1058 ^^cCTCcI^Ti^CTlcTCATTAACAGATGAGTTCTGCAGAAACCACTTCTTG^^^ 

HC2D KIAA1058 jy^(^cCTCCAGCTTGACTACTCATTAACAGATGAGTTCTGCAGAAACCACTTCTT^ 

HC2F 
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HC2A GGACTGTTACTGAGGGAGGTGGGGACA6CCCTCCAGGAGTTCCGGGAGGTCCGTCTGATC 

HC2-80 GGACTGTTACTGAGGGAGGTGGGGACAGCCCTCCAGGAGTT.CCGGGAGGTCCGTCTGATC 

IJC2B GGACTGTTACTGAGGGAGGTGGGGACAGCCCTCCAGGAGTTCCGGGAGGTCCGTCTGATC 

HC2C ~"" — — — — "* 

HC2D-KIAA1058 gGACTGTTACTGAGGGAGGTGGGGACAGCCCTCCAGGAGTTCCGGGAGGTCCGTCTGATC 

HC2E gGACTGTTACTGAGGGAGGTGGGGACAGCCCTCCAGGAGTTCCGGGAGGTCCGTCTGATC 

HC2F 

HC2A gccatcagtgtgctcaagaacctgctgataaagcattcttttgatgacagatatgcttca 

HC2-80 GCCATCAGTGTGCTCAAGAACCTGCTGATAAAGCATTCTTTTGAT6ACAGATATGCTTCA 

HC2B GCCATCAGTGTGCTCAAGAACCTGCTGATAAAGCATTCTTTTGATGACAGATATGCTTCA 

HC2C ~ ~" 

HC2D-KIAA1058 gCCATCAGTGTGCTCAAGAACCTGCTGATAAAGCATTCTTTTGATGACAGATATGCTTCA 

HC2E gCCATCAGTGTGCTCAAGAACCTGCTGATAAAGCATTCTTTTGATGACAGATATGCTTCA 

HC2F 

HC2A aggagccatcaggcaaggatagccaccctctacctgcctctgtttggtctgctgattgaa 

HC2-80 aggagccatcaggcaaggatagccaccctctacctgcctctgtttggtctgctgattgaa 

HC2B aggagccatcaggcaaggatagccaccctctacctgcctctgtttgg^ctgctgattgaa 

HC2C! *~ ~* 

HC2D-KIAA1058 aGGAGCCATCAGGCAAGGATAGCCACCCTCTACCTGCCTCTGTTTGGTCTGCTGATTGAA 

HC2E aggagccatcaggcaaggatagccaccctctacctgcctctgtttggtc^ 

HC2F : 

aacgtccagcggatcaatgtgagggatgtgtcacccttccctgtgaacgcgggcatgacc 

HC2-80 aacgtccagcggatcaatgtgagggatgtgtcacccttccctgtgaacgcgggcatgacc 

HC2B aacgtccagcggatcaatgtgagggatgtgtcacccttccctgtgaacgcgggcatgacc 

HC2C "~ — — — — ~ 

HC2D-KIAA1058 aACGTCCAGCGGATCAATGTGAGGGATGTGTCACCCTTCCCTGTGAACGCGGGCATGACT 

HC2D KIAA1058 ^cGTCCAGCGGATCAATGTGAGGGATGTGTCACCCTTCCCTGTGAACGCGGGC^^ 

HC2F 

GTGAAGGATGAATCCCTGGCTCTACCAGCTGTGAATCCGCTGGTGACGCCGCAGAAGGGA 

„C2-aO gtgaaggatgaatccctggctctaccagctgtgaatccgctggtgacgccgcagaaggga 

HC2B gtgaaggatgaatccctggctctaccagctgtgaatccgctggtgacgccgcagaaggga 

5c2D-KIAA1058 GT^^i^TCCCTGGCTCTACCAGCTGTGAATCCGCTCGTGACGCCGCA^ 

5c2E GTGAAGGATGAATCCCTGGCTCTACCAGCTGTGAATCCGCTGGTGACGCCGCAGAA^ 

HC2P 

AGCACCCTGGACAACAGCCTGCACAAGGACCTGCTGGGCGCCATCTCCGGCATTGCTTCT 

HC2-80 AGCACCCTGGACAACAGCCTGCACAAGGACCTGCTGGGCGCCATCTCCGGCATTGCTTCT 

AGCACCCTGGACAACAGCCTGCACAAGGACCTGCTGGGCGCCATCTCCGGCATTG^ 

HC2D-KIAA1058 AGcIcCCT^^^^ci^A^GGACCTGCTGGGCGCCATCTCCGGCATTGCTTCT 

HC2D KIAA1058 JJ^^cCCTGGACAACAGCCTGCACAAGGACCTGCTGGGCGCCATCTCCGGCATTGC 

HC2P 
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HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



CCATATACAACCTCAACTCCAAACATCAACAGTGTGAGAAATGCTGATTCGAGAGGATCT 
CCATATACAACCTCAACTCCAAACATCAACAGTGTGAGAAATGCTGATTCGAGAGGATCT 
CCATATACAACCTCAACTCCAAACATCAACAGTGTGAGAAATGCTGATTCGAGAGGATCT 

CCATATACAACCTCAACTCCAAACATCAACAGTGTGAGAAATGCTGATTCGAGAGGATCT 
CCATATACAACCTCAACTCCAAACATCAACAGTGTGAGAAATGCTGATTCGAGAGGATCT 
GCTGATTCGAGAGGATCT 

CTCATAAGCACAGATTCGGGTAACAGCCTTCCAGAAAGGAATAGTGAGAAGAGCAATTCC 
CTCATAAGCACAGATTCGGGTAACAGCCTTCCAGAAAGGAATAGTGAGAAGAGCAATTCC 
CTCATAAGCACAGATTCGGGTAACAGCCTTCCAGAAAGGAATAGTGAGAAGAGCAATTCC 

CTCATAAGCACAGATTCGGGTAACAGCCTTCCAGAAAGGAATAGTGAGAAGAGCAATTCC 
CTCATAAGCACAGATTCGGGTAACAGCCTTCCAGAAAGGAATAGTGAGAAGAGCAATTCC 
CTCATAAGCACAGATTCGGGTAACAGCCTTCCAGAAAGGAATAGTGAGAAGAGCAATTCC 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



CTGGATAAGCACCAACAAAGTAGCACATTGGGAAATTCCGTGGTTCGCTGTGATAAACTT 
CTGGATAAGCACCAACAAAGTAGCACATTGGGAAATTCCGTGGTTCGCTGTGATAAACTT 
CTGGATAAGCACCAACAAAGTAGCAGATTGGGAAATTCCGTGGTTCGCTGTGATAAACTT 

CTGGATAAGCACCAACAAAGTAGCACATTGGGAAATTCCGTGGTTCGCTGTGATAAACTT 
CTGGATAAGCACCAACAAAGTAGCACATTGGGAAATTCCGTGGTTCGCTGTGATAAACTT 
CTGGATAAGCACCAACAAAGTAGCACATTGGGAAATTCCGTGGTTCGCTGTGATAAACTT 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



GACCAGTCTGAGATTAAGAGCCTACTGATGTGTTTCCTCTACATCTTAAAGAGCATGTCT 
GACCAGTCTGAGATTAAGAGCCTACTGATGTGTTTCCTCTACATCTTAAAGAGCATGTCT 
GACCAGTCTGAGATTAAGAGCCTACTGATGTGTTTCCTCTACATCTTAAAGAGCATGTC 

GACCAGTCTGAGATTAAGAGCCTACTGATGTGTTTCCTCTACATCTTAAAGAGCATGTCT 
GACCAGTCTGAGATTAAGAGCCTACTGATGTGTTTCCTCTACATCTTAAAGAGCATGTCT 
GACCAGTCTGAGATTAAGAGCCTACTGATGTGTTTCCTCTACATCTTAAAGAGCATGTCT 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



GATGATGCTTTGTTTACATATTGGAACAAGGCTTCAACATCTGAACTTATGGATTTTTTT 
GATGATGCTTTGTTTACATATTGGAACAAGGCTTCAACATCTGAACTTATGGATTTTTTT 
GATGATGCTTTGTTTACATATTGGAACAAGGCTTCAACATCTGAACTTATGGATTTTTTT 

GATGATGCTTTGTTTACATATTGGAACAAGGCTTC^ 

GATGATGCTTTGTTTACATATTGGAACAAGGCTTCAACATCTGAACTTATGGATTTTTTT 
GATGATGCTTTGTTTACATATTGGAACAAGGCTTCAACATCTGAACTTATGGATTTTTTT 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



ACAATATCTGAAGTCTGCCTGCACCAGTTCCAGTACATGGGGAAGCGATACATAGCCAGG 
ACAATATCTGAAGTCTGCCTGCACCAGTTCCAGTACATGGGGAAGCGATACATAGCCAGG 
ACAATATCTGAAGTCTGCCTGCACCAGTTCCAGTACATGGGGAAGCGATACATAGCCAGG 

ACAATATCTGAAGTCTGCCTGCACCAGTTCCAGTAC^^ 

ACAATATCTGAAGTCTGCCTGCACCAGTTCCAGTACATGGGGAAGCGATACATAGCCAGG 
ACAATATCTGAAGTCTGCCTGCACCAGTTCCAGTACATGGGGAAGCGATACATAGCCAG- 
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HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



AACCAGGAGGGGTTGGGACCCATAGTTCATGATCGAAAGTCTCAGACATTGCCTGTTTCC 
AACCAGGAGGGGTTGGGACCCATAGTTCATGATCGAAAGTCTCAGACATTGCCTGTTTCC 
AACCAGGAGGGGTTGGGACCCATAGTTCATGATCGAAAGTCTCAGACATTGCCTGTTTCC 



AACCAGGAGGGGTTGGGACCCATAGTTCATGATCGAAAGTCTCAGACATTGCCTGTTTCC 
TGTGA GAAAG ATATCAAGTGT 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



CGTAACAGAACAGGAATGATGCATGCCAGATTGCAGCAGCTGGGCAGCCTGGATAACTCT 
CGTAACAGAACAGGAATGATGCATGCCAGATTGCAGCAGCTGGGCAGCCTGGATAACTCT 
CGTAACAGAACAGGAATGATGCATGCCAGATTGCAGCAGCTGGGCAGCCTGGATAACTCT 

CAGGAATGATGCATGCCAGATTGCAGCAGCTGGGCAGCCTGGATAACTCT 

CGTAACAGAACAGGAATGATGCATGCCAGATTGCAGCAGCTGGGCAGCCTGGATAACTCT 
GCTTGGAA 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



CTCACTTTTAACCACAGCTATGGCCACTCGGACGCAGATGTTCTGCACCAGTCATTACTT 
CTCACTTTTAACCACAGCTATGGCCACTCGGACGCAGATGTTCTGCACCAGTCATTACTT 
CTCACTTTTAACCACAGCTATGGCCACTCGGACGCAGATGTTCTGCACCAGTCATTACTT 

CTCACTTTTAACCACAGCTATGGCCACTCGGACGCAGATGTTCTGCACCAGTCATTACTT 
CTCACTTTTAACCACAGCTATGGCCACTCGGACGCAGATGTTCTGCACCAGTCATTACTT 
-TTTCTGTAGACAATGGCTATGGCCACTCGGACGCAGATGTTCTGCACCAGTCATTACTT 



HC2A 

HC2-80 

HC2B 

HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



GAAGCCAACATTGCTACTGAGGTTTGCCTGACAGCTCTGGACACGCTTTCTCTATTTACA 
GAAGCCAACATTGCTACTGAGGTTTGCCTGACAGCTCTGGACACGCTTTCTCTATTTACA 
GAAGCCAACATTGCTACTGAGGTTTGCCTGACAGCTCTGGACACGCTTTCTCTATTTACA 

GAAGCCAACATTGCTACTGAGGTTTGCCTGACAGCTCTGGACACGCTTTCTCTATTTACA 
GAAGCCAACATTGCTACTGAGGTTTGCCTGACAGCTCTGGACACGCTTTCTCTATTTACA 
GAAGCCAACATTGCTACTGAGGTTTGCCTGACAGCTCTGGACACGCTTTCTCTATTTACA 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



TTGGCGTTTAAGAACCAGCTCCTGGCCGACCATGGACATAATCCTCTCATGAAAAAAGTT 
TTGGCGTTTAAGAACCAGCTCCTGGCCGACCATGGACATAATCCTCTCATGAAAAAAGTT 
TTGGCGTTTAAG CTCCTGGCC6ACCATGGACATAATCCTCTCATGAAAAAAGTT 

TTGGCGTTTAAGAACCAGCTCCTGGCCGACCATGGACATAATCCTCTCATGAAAAAAGTT 
TTGGCGTTTAAGAACCAGCTCCTGGCCGACCATGGACATAATCCTCTCATGAAAAAAGTT 
TTGGCGTTTAAGAACCAGCTCCTGGCCGACCATGGACATAATCCTCTCATGAAAAAAAAA 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



TTTGATGTCTACCTGTGTTTTCTTCAAAAACATCAGTCTGAAACGGCTTTAAAAAATGTC 
TTTGATGTCTACCTGTGTTTTCTTCAAAAACATCAGTCTGAAACGGCTTTAAAAAATGTC 
TTTGATGTCTACCTGTGTTTTCTTCAAAAACATCAGTCTGAAACGGCTTTAAAAAATGTC 

TTTGATGTCTACCTGTGTTTTCTTCAAAAACATCAGTCTGAAACGGCTTTAAAAAATGTC 
TTTGATGTCTACCTGTGTTTTCTTCAAAAACATCAGTCTGAAACGGCTTTAAAAAATGTC 
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TTCACTGCCTTAAGGTCCTTAATTTATAAGTTTCCCTCAACATTCTATGAAGGGAGAGCG 

HC2 -80 rjtrjicACTGCCTTAAGGTCCTTAATTTATAAGTTTCCCTCAACATTCTATGAAGGGAGAGCG 

jj^^2B TTCACTGCCTTAAGGTCCTTAATTTATAAGTTTCCCTCAACATTCTATGAAGGGAGAGCG 

HC2C ' 

HC2D-KIAA1058 ttCACTGCCTTAAGGTCCTTAATTTATAAGTTTCCCTCAACATTCTATGAAGGGAGAGCG 

tTCACTGCCTTAAGGTCCTTAATTTATAAGTTTCCCTCAACATTCTATGAAGGGAGAGCG 

HC2F 

HC2A gacatgtgtgcggctctgtgttacgagattctcaagtgctgtaactccaagctgagctcc 

HC2 - 8 0 gacatgtgtgcggctctgtgttacgagattctcaagtgctgtaactccaagctgagctcc 

HC2B gacatgtgtgcggctctgtgttacgagattctcaagtgctgtaactccaagctgagctcc 

HC2C *" " 

HC2D-KIAA1058 gACATGTGTGCGGCTCTGTGTTACGAGATTCTCAAGTGCTGTAACTCCAAGCTGAGCTCC 

JJC2E gacatgtgtgcggctctgtgttacgagattctcaagtgctgtaactccaagctgagctcc 

HC2F 

HC2A atcaggacggaggcctcccagctgctctacttcctgatgaggaacaactttgattacact 

HC2 - 80 atcaggacggaggcctcccagctgctctacttcctgatgaggaacaactttgattacact 

HC2B atcaggacggaggcctcccagctgctctacttcctgatgaggaacaactttgattacact 

HC2C "~ — ~- — -* — — — ~"~ "~ 

HC2D-KIAA1058 aTCAGGACGGAGGCCTCCCAGCTGCTCTACTTCCTGATGAGGAACAACTTXGATTACACT 

JJC2E aTCAGGACGGAGGCCTCCCAGCTGCTCTACTTCCTGATGAGGAACAACTTTGATTACACT 

HC2F 

HC2A ggaaagaagtcctttgtccggacacatttgcaagtcatcatatctgtcagccagctgata 
HC2 -80 ggaaagaagtcctttgtccggacacatttgcaagtcatgatatctgtcagccagctgata 
HC2B ggaaagaagtcctttgtccggacacatttgcaagtcatcatatctgtcagccagctgata 

HC2C 

HC2D-KIAA1058 gGAAAGAAGTCCTTTGTCCGGACACATTTGCAAGTCATCATATCTGTCAGCCAGCTGATA 

HC2E ggaaagaagtcctttgtccggacacatttgcaagtcatcatatctgtcagccac^ 

HC2F 

HC2A gcagacgttgttggcattggggaaaccagattccagcagtccctgtccatcatcaacaac 
HC2 - 8 0 gcagacgttgttggcattggggaaaccagattccagcagtccctgtccatcatcaacaac 
HC2B gcagacgttgttggcattggggaaaccagattccagcagtccctgtccatgatcaacaac 

HC2C ——— ———— 

HC2D -KIAAIO 5 8 gCAGACGTTGTTGGCATTGGGGGAACCAGATTCCAGCAGTCCCTGTCCATCATCAACAAC 
HC2E GCyVGACGTTGTTGGCATTGGGGAAACCAGATTCCAGCAGTCCCTGTCCJ^TCATCAACA^ 

HC2F 

HC2A TGTGCCAACAGTGACCGGCTTATTAAGCACACCAGCTTCTCCTCTGATGTGAAGGACTTA 
HC2-80 TGTGCCAACAGTGACCGGCTTATTAAGCACACCAGCTTCTCCTCTGATGTGAAGGACTTA 
HC2B TGTGCCAACAGTGACCGGCTTATTAAGCACACCAGCTTCTCCTCTGATGTGAAGGACTTA 
HC2C ——————— 

HC2D-KIAA1058 tGTGCCAACAGTGACCGGCTTATTAAGCACACCAGCTTCTCCTCTGATGTGAAGGACTTA 
HC2E TGTGCCAACA6TGACCGGCTTATTAAGCACACCAGCTTCTCCTCTGATGTGAAGGACTTA 

HC2F 



FIG. 3A (cont.) 



24/87 



HC2A ACCAAAAGGATACGCACGGTGCTAATGGCCACCGCCCAGATGAAGGAGCATGAGAACGAC 

HC2-80 ACCAAAAGGATACGCACGGTGCTAATGGCCACCGCCCAGATGAAGGAGCATGAGAACGAC 

HC2B ACCAAAAGGATACGCACGGTGCTAATGGCCACCGCCCAGATGAAGGAGCATGAGAACGAC 

HC2C *~ ~ — — — 

HC2D-KIAA1058 aCCAAAAGGATACGCACGGTGCTAATGGCCACCGCCCAGATGAAGGAGCATGAGAACGAC 

HC2E aCCAAAAGGATACGCACGGTGCTAATGGCCACCGCCCAGATGAAGGAGCATGAGAACGAC 

HC2F 

„r.2A ccagagatgctggtggacctccagtacagcctggccaaatcctatgccagcacgcccgag 

HC2-80 ccagagatgctggtggacctccagtacagcctggccaaatcctatgccagcacgcccgag 

HC2B ccagagatgctggtggacctccagtacagcctggccaaatcctatgccagcacgcccgag 

HC2C *~ 

HC2D-KIAA1058 cCAGAGATGCTGGTGGACCTCCAGTACAGCCTGGCCAAATCCTATGCCAGCACGCCCGAG 

CCAGAGATGCTGGTGGACCTCCAGTACAGCCTGGCCAAATCCTATGCCAGCACGCCCGAG 

HC2F 

„P2, CTCAGGAA6ACGTGGCTCGACAGCATGGCCAGGATCCATGTCAAAAATGGCGATCTCTCA 

"c2 -80 CTCAGGAAGACGTGGCTCGACAGCATGGCCAGGATCCATGTCAAAAATGGCGATCTCTCA 

gC2B CTCAGGAAGACGTGGCTCGACAGCATGGCCAGGATCCATGTCAAAAATGGCGATCTCTCA 

HP2C! — ~* *~ — *" 

HC2D-KIAA1058 CTCAGGAAGACGTGGCTCGACAGCATGGCCAGGATCCATGTCAAAAATGGCGATCTCTCA 

CTCAGGAAGACGTGGCTCGACAGCATGGCCy^GGATCCATGTCAAAAATGGCGATCTCT^ 

HC2F 

GAGGCAGCAATGTGCTATGTCCACGTAACAGCCCTAGTG6CAGAATATCTCACACGGAAA 

HC2-80 GAGGCAGCAATGTGCTATGTCCACGTAACAGCCCTAGTGGCAGAATATCTCACACGGAAA 

HC2B GAGGCAGCAATGTGCTATGTCCACGTAACAGCCCTA6TGGCAGAATATCTCACACGGAAA 

Sc2D-KIAA1058 ^^^^^TGTGCTATGTCCACGTAACAGCCCTAGTGGCAGAATATCTCACACGGAAA 

HC2E GAGGOWXa^TGTGCTATGTCCACGTAACAGCCCTAGTGGCy^GAATATCTaVCACGGA^ 

HC2F 

HC2A G I IIIIII 

HC2-80 G ~_ ~ """" ~~ " 

HC2B G ^ I 

Slc2D-KIAA1058 ^^OTC^Gi^GCCGCCCCTTCTCCCCCACAGCCATAGCGCCTGCCT^ 

HC2E G IIIIII 

HC2F 

- GCGTGTTTAGACAAGGATGCACCGCCTTCAGGGTCATTACCCCAAACATC 

„^,_nn GCGTGTTTAGACAAGGATGCACCGCCTTCAGGGTCATTACCCCAAACATC 

l^lvT GCGTGTTTAGACAAGGATGCACCGCCTTCAGGGTCATTACCCCAAACATC 

„^," GTTTAGACAAGGATGCACCGCCTTCAGGGTCATTACCCCAAACATC 

HC2S-KIAAIO58 AGCCGGGGAGGCGTGTTTAGACAAGGATGCACCGCCTTCAGGGTCATTACCCCAAACATC 

HC2D KIAA1058 AW-L. qcGTGTTTAGACAAGGATGCACCGCCTTCAGGGTCATTACCCCAAACATC 

HC2F 
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HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



GACGAGGAGGCCTCCATGATGGAAGACGTGGGGATGCAGGATGTCCATTTCAACGAGGAT 
GACGAGGAGGCCTCCATGATGGAAGACGTGGGGATGCAGGATGTCCATTTCAACGAGGAT 
GACGAGGAGGCCTCCATGATGGAAGACGTGGGGATGCAGGATGTCCATTTCAACGAGGAT 
GACGAGGAGGCCTCCATGATGGAAGACGTGGGGATGCAGGATGTCCATTTCAACGAGGAT 
GACGAGGAGGCCTCCATGATGGAAGACGTGGGGATGCAGGATGTCCATTTCAACGAGGAT 

GACGAGGAGGCCTCCATGATGGAAGACGTGGGGA 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



GTGCTGATGGAGCTCCTTGAGCAGTGCGCAGATGGACTCTGGAAAGCCGAGCGCTACGAG 
GTGCTGATGGAGCTCCTTGAGCAGTGCGCAGATGGACTCTGGAAAGCCGAGCGCTACGAG 
GTGCTGATGGAGCTCCTTGAGCAGTGCGCAGATGGACTCTGGAAAGCCGAGCGCTACGAG 
GTGCTGATGGAGCTCCTTGAGCAGTGCGCAGATGGACTCTGGAAAGCCGAGCGCTACGAG 
GTGCTGATGGAGCTCCTTGAGCAGTGCGCAGATGGACTCTGGAAAGCCGAGCGCTACGAG 
AAGCCGAGCGCTACGAG 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



CTCATCGCCGACATCTACAAACTTATCATCCCCATTTATGAGAAGCGGAGGGATTT 

CTCATCGCCGACATCTACAAACTTATCATCCCCATTTATGAGAAGCGGAGGGATTT 

CTCATCGCCGACATCTACAAACTTATCATCCCCATTTATGAGAAGCGGAGGGATTTTGAG 
CTCATCGCCGACATCTACAAACTTATCATCCCCATTTATGAGAAGCGGAGGGATTTTGAG 
CTCATTGCCGACATCTACAAACTTATCATCCCCATTTATGAGAAGCGGAGGGATTTTGAG 
CTCATCGCCGACATCTAC^CTTATCATCCCCATTTATGAGAAGCGGAGGGATTTTGAG 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



AGGCTGGCCCATCTGTATGACACGCTGCACCGGGCCTACAGCAAAGTGACCGAGGTCATG 
AGGCTGGCCCATCTGTATGACACGCTGCACCGGGCCTACAGCAAAGTGACCGAGGTCATG 
AGGCTGGCCCATCTGTATGACACGCTGCACCGGGCCTACAGCAAAGTGACCGAGGTCATG 
AGGCTGGCCCATCTGTATGACACGCTGCACCGGGCCTACAGCAAAGTGACCGAGGTCATG 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



CACTCGGGCCGCAGGCTTCTGGGGACCTACTTCCGGGTAGCCTTCTTCGGGCAGG 

CACTCGGGCCGCAGGCTTCTGGGGACCTACTTCCGGGTAGCCTTCTTCGGGCAGG 

CACTCGGGCCGCAGGCTTCTGGGGACCTACTTCCGGGTAGCCTTCTTCGGGCAGGCAGCG 
CACTCGGGCCGCAGGCTTCTGGGGACCTACTTCCGGGTAGCCTTCTTCGGGCAGG 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



CTTTGAAGATGAAGATGGA 

CTTTGAAGATGAAGATGGA 

" 1 GATTCTTTGAAGATGAAGATGGA 

" GATTCTTTGAAGATGAAGATGGA 

CAATACCAGTTTACAGACAGTGAAACAGATGTGGAGGGATTCTTTGAAGATGAAGATGGA 
GATTCTTTGAAGATGAAGATGGA 
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HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



AAGGAGTATATTTACAAGGAACCCAAACTCACACCGCTGTCGGAAATTTCTCAGAGACTC 
AAGGAGTATATTTACAAGGAACCCAAACTCACACCGCTGTCGGAAATTTCTCAGAGACTC 
AAGGAGTATATTTACAAGGAACCCAAACTCACACCGCTGTCGGAAATTTCTCAGAGACTC 
AAGGAGTATATTTACAAGGAACCCAAACTCACACCGCTGTCGGAAATTTCTCAGAGACTC 
AAGGAGTATATTTACAAGGAACCCAAACTCACACCGCTGTCGGAAATTTCTCAGAGACTC 
AAGGAGTATATTTACAAGGAACCCAAACTCACACCGCTGTCGGAAATTTCTCAGAGACTC 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



CTTAAACTGTACTCGGATAAATTTGGTTCTGAAAATGTCAAAATGATACAGGATTCTGGC 
CTTAAACTGTACTCGGATAAATTTGGTTCTGAAAATGTCAAAATGATACAGGATTCTGGC 
CTTAAACTGTACTCGGATAAATTTGGTTCTGAAAATGTCAAAATGATACAGGATTCTGGC 
CTTAAACTGTACTCGGATAAATTTGGTTCTGAAAATGTCAAAATGACACAGGATTCTGGC 
CTTAAACTGTACTCGGATAAATTTGGTTCTGAAAATGTCAAAATGATACAGGATTCTGGC 
CTTAAACTGTACTCGGATAAATTTGGTTCTGAAAATGTCAAAATGATACAGGATTCTGGC 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



AAGGTCAACCCTAAGGATCTGGATTCTAAGTATGCATACATCCAGGTGACTCACGTCATC 
AAGGTCAACCCTAAGGATCTGGATTCTAAGTATGCATACATCCAGGTGACTCACGTCATC 
AAGGTCAACCCTAAGGATCTGGATTCTAAGTATGCATACATCCAGGTGACTCACGTCATC 
AAGGTCAACCCTAAGGATCTGGATTCTAAGTATGCATACATCCAGGTGACTCACGTCATC 
AAGGTCAACCCTAAGGATCTGGATTCTAAGTATGCCTACATCCAGGTGACTCACGTCATC 
AAGGTCAACCCTAAGGATCTGGATTCTAAGTATGdATACATCCAGGTGACTCACGTCATC 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



CCCTTCTTTGACGAAAAAGAGTXGCAAGAAAGGAAAACAGAGTTTGAGAGATCCCACAAC 
CCCTTCTTTGACGAAAAAGAGTTGCAAGAAAGGAAAACAGAGTTTGAGAGATCCCACAAC 
CCCTTCTTTGACGAAAAAGAGTTGCAAGAAAGGAAAACAGAGTTTGAGAGATCCCACAAC 
CCCTTCTTTGACGAAAAAGAGTTGCAAGAAAGGAAAACAGAGTTTGAGAGATCCCACAAC 
CCCTTCTTTGACGAAAAAGAGTTGCAAGAAAGGAAAACAGAGTTTGAGAGATCCCACAAC 
CCCTTCTTTGACGAAAAAGAGTTGCAAGAAAGGAAAACAGAGTTTGAGAGATCCCACAAC 



HC2A 
HC2-B0 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



ATCCGCCGCTTCATGTTTGAGATGCCATTTACGCAGACCGGGAAGAGGCAGGGCGGGGTG 
ATCCGCCGCTTCATGTTTGAGATGCCATTTACGCAGACCGGGAAGAGGCAGGGCGGGGTG 
ATCCGCCGCTTCATGTTTGAGATGCCATTTACGCAGACCGGGAAGAGGCAGGGCGGGGTG 
ATCCGCCGCTTCATGTTTGAGATGCCATTTACGCAGACCGGGAAGAGGCAGGGCGGGGTG 
ATCCGCCGCTTCATGTTTGAGATGCCATTTACGCAGACCGGGAAGAGGCAGGGCGGGGTG 
ATCCGCCGCTTCATGTTTGAGATGCCATTTACGCAGACCGGGAAGAGGCAGGGCGGGGTG 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



GAAGAGCAGTGCAAACGGCGCACCATCCTGACAGCCy^TACACTGCTTCCCTTATGTG^ 

GAAGAGCAGTGCAAACGGCGCACCATCCTGACAGCCATACACTGCTTCCCTTATGTGAAG 

GAAGAGCAGTGCAAACGGCGCACCATCCTGACAGCCATACACTGCTTCCCTTATGTGAAG 

GAAGAGCAGTGCAAACGGCGCACCATCCTGACAGCCATACACTGCTTCCCTTATGTGAAG 

GAAGAGCAGTGCAAACGGCGCACCATCCTGACAGCCATACACTGCTTCCCTTATGTGAAG 

GAAGAGCAGTGCAAACGGCGCACCATCCTGACAGCCATACACTGCTTCCCTTATGTGAAG 



FIG. 3A (cont) 



27/87 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



AAGCGCATCCCTGTCATGTACCAGCACCACACTGACCTGAACCCCATCGAGGTGGCCATT 
AAGCGCATCCCTGTCATGTACCAGCACCACACTGACCTGAACCCCATCGAGGTGGCCATT 
AAGCGCATCCCTGTCATGTACCAGCACCACACTGACCTGAACCCCATCGAGGTGGCCATT 
AAGCGCATCCCTTTCATGTACCAGCACCACACTGACCTGAACCCCATCGAGGT-'CCATT 
AAGCGCATCCCTGTCATGTACCAGCACCACACTGACCTGAACCCCATCGAGGTGGCCATT 
AAGCGCATCCCTGTCATGTACCAGCACCACACTGACCTGAACCCCATCGAGGTGGCCATT 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



GACGAGATGAGTAAGAAGGTGGCGGAGCTCCGGCAGCTGTGCTCCTCGGCCGAGGTGGAC 
GACGAGATGAGTAAGAAGGTGGCGGAGCTCCGGCAGCTGTGCTCCTCGGCCGAGGTGGAC 
GACGAGATGAGTAAGAAGGTGGCGGAGCTCCGGCAGCTGTGCTCCTCGGCCGAGGTGGAC 
GACGAGATGAGTAAGAAGGTGGCGGAGCTCCGGCAGCTGTGCTCCTCGGCCGAGGTGGAC 
GACGAGATGAGTAAGAAGGTGGCGGAGCTCCGGCAGCTGTGCTCCTCGGCCGAGGTGGAC 
GACGAGATGAGTAAGAAGGTGGCGGAGCTCCGGCAGCTGTGCTCCTCGGCCGAGGTGGAC 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



ATGATCAAACTGCAGCTCAAACTCCAGGGCAGCGTGAGTGTTCAGGTCAATGCTGGCCCA 
ATGATCAAACTGCAGCTCAAACTCCAGGGCAGCGTGAGTGTTCAGGTCAATGCTGGCCCA 
ATGATCAAACTGCAGCTCAAACTCCAGGGCAGCGTGAGTGTTCAGGTCAATGCTGGCCCA 
ATGATCAAACTGCAGCTCAAACTCCAGGGCAGCGTGAGTGTTCAGGTCAATGCTGGCCCA 
ATGATCAAACTGCAGCTCAAACTCCAGGGCAGCGTGAGTGTTCAGGTCAATGCTGGCCCA 
ATGATCAAACTGCAGCTCAAACTCCAGGGCAGCGTGAGTGTTCAGGTCAATGCTGGCCCA 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



CTAGCATATGCGCGAGCTTTCTTAGATGATACAAACACAAAGCGATATCCTGACAATAAA 
CTAGCATATGCGCGAGCTTTCTTAGATGATACAAACACAAAGCGATATCCTGACAATAAA 
CTAGCATATGCGCGAGCTTTCTTAGATGATACAAACACAAAGCGATATCCTGACAATAAA 
CTAGCATATGCGCGAGCTTTCTTAGATGATACAAACACAAAGCGATATCCTGACAATAAA 
CTAGCATATGCGCGAGCTTTCTTAGATGATACAAACACAAAGCGATATCCTGACAATAAA 
CTAGCJ^TATGCGCGAGCTTTCTTAGATGATACAAACACAAAGCGATATCCTGACAATA^ 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



GTGAAGCTGCTTAAGGAAGTTTTCAGGCAATTTGTGGAAGCTTGCGGTCAAGCCTTAGCG 
GTGAAGCTGCTTAAGGAAGTTTTCAGGCAATTTGTGGAAGCTTGCGGTCAAGCCTTAGCG 
GTGAAGCTGCTTAAGGAAGTTTTCAGGCAATTTGTGGAAGCTTGCGGTCAAGCCTTAGCG 
GTGAAGCTGCTTAAGGAAGTTTTCAGGCAATTTGTGGAAGCTTGCGGTCAAGCCTTAGCG 
GTGAAGCTGCTTAAGGAAGTTTTCAGGCAATTTGTGGAAGCTTGCGGTCAAGCCTTAGCG 
GTGAAGCTGCTTAAGGAAGTTTTCAGGCAATTTGTGGAAGCTTGCGGTCAAGCCTTAGCG 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



GTAAACGAACGTCTGATTAAAGAAGACCAGCTCGAGTATCAGGAAGAAATGAAAGCCAAC 
GTAAACGAACGTCTGATTAAAGAAGACCAGCTCGAGTATCAGGAAGAAATGAAAGCCAAC 
GTAAACGAACGTCTGATTAAAGAAGACCAGCTCGAGTATCAGGAAGAAATGAAAGCCAAC 
GTAAACGAACGTCTGATTAAAGAAGACCAGCTCGAGTATCAGGAAGAAATGAAAGCCAAC 
GTAAACGAACGTCTGATTAAAGAAGACCAGCTCGAGTATCAGGAAGAAATGAAAGCCAAC 
GTAAACGAACGTCTGATTAAAGAAGACCAGCTCGAGTATCAGGAAGAAATGAAAGCCAAC 
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HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



TACAGGGAAATGGCGAAGGAGCTTTCTGAAATCATGCATGAGCAG ATCTGCC 

TACAGGGAAATGGCGAAGGAGCTTTCTGAAATCATGCATGAGCAG ATCTGCC 

TACAGGGAAATGGCGAAGGAGCTTTCTGAAATCATGCATGAGCAG ATCTGCC 

TACAGGGAAATGGCGAAGGAGCTTTCTGAAATCATGCATGAGCAG ATCTGCC 

TACAGGGAAATGGCGAAGGAGCTTTCTGAAATCATGCATGAGCAGCTGGGATGATCTGCC 
TACAGGGAAATGGCGAAGGAGCTTTCTGAAATCATGCATGAGCAG ATCTGCC 



HC2A 
HC2-80 
HC2B 
HC2C 

HC2D-KIAA1058 

HC2E 

HC2F 



CCCTGGAGGAGAAGACGAGCGTCTTACCGAATTCCCTTCACATCTTCAACGCCATCAGTG 
CCCTGGAGGAGAAGACGAGCGTCTTACCGAATTCCCTTCACATCTTCAACGCCATCAGTG 
CCCTGGAGGAGAAGACGAGCGTCTTACCGAATTCCCTTCACATCTTCAACGCCATCAGTG 
CCCTGGAGGAGAAGACGAGCGTCTTACCGAATTCCCTTCACATCTTCAACGCCATCAGTG 
CCCTGGAGGAGAAGACGAGCGTCTTACCGAATTCCCTTCACATCTTCAACGCCATCAGTG 
CCCTGGAGGAGAAGACGAGCGTCTTACCGAATTCCCTTCACATCTTCAACGCCATCAGTG 



HC2A GGACTCCAACAAGCACAATGGTTCACGGGATGACCAGCTCGTCTTCGGTCGTGTGATTAC 
HC2-80 ' GGACTCCAACAAGCACAATGGTTCACGGGATGACCAGCTCGTCTTCGGTCGTGTGATTAC 

HC2B GGACTCCAACAAGCACAATGGTTCACGGGATGACCAGCTCGTCTTCGGTCGTGTGA 

HC2C " GGACTCCAACAAGCACAATGGTTCACGGGATGACCAGCTCGTCTTCGGTCGTGTGA 

HC2D-KIAA1058 gGACTCCAACAAGCACAATGGTTCACGGGATGACCAGCTCGTCTTCGGTCGTGTGATTAC 

HC2E ggactccaacaagcacaatggttcacgggatgaccagctcgtcttcggtcgtgtga---- 

HC2F 

HC2A atctcatggcccgtgtgtggggacttgctttgtcatttgcaaactcaggatgctttccaa 

JJC2-80 atctcatggcccgtgtgtggggacttgctttgtcatttgcaaactcaggatgctttccaa 

HC2B "IIII'IIIIIIIIIIIIII---" --------- 

HC2C 

HC2D-KIAA1058 aTCTCATGGCCCGTGTGTGGGGACTTGCTTTGTCATTTGCAAACTCAGGATGCTTTCCAA 

HC2E IIII""ZI~IIIIIIIIII""I I- 

HC2F 

HC2A aGCCAATCACTGGGGAGACCGAGCACAGGGAGGACCAAGGGGAAGGGGAGAGAAAGGAAA 

HC2~80 aGCCAATCACTGGGGAGACCGAGCACAGGGAGGACCAAGGGGAAGGGGAGAGAAAGGAAA 

HC2B " ~ IIIIIIIIIIIIIIIII 

HC2C ' — — """" 

HC2D-KIAA1058 aGCCAATCACTGGGGAGACCGAGCACAGGGAGGACCA-GGGGAAGGGGAGAGAAAGGAAA 

HC2E IIIIIIIIIIIII"- 

HC2F 

HC2A taaagaacaacgttatttcttaacagactttctataggagttgtaagaaggtgcacatat 

HC2 -8 0 taaagaacaacgttatttcttaacagactttctataggagttgtaagaaggtgcacatat 

HC2B IIIIIIIIIIIIIIIIIIII 

HC2C ~" 

HC2D-KIAA1058 tAAAGAACAACGTTATTTCTTAACAGACTTTCTATAGGAGTTGTAAGAAGGTGC^^ 

HC2E """IIIIIIII" 

HC2F 
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J.C2A TTTTTTAAATCTCACTGGCAATATTCAAAGTTTTCATTGTGTCTTAACAAAGGTGTGGTA 

HC2-80 TTTTTTAAATCTCACTGGCAATATTCyVAAGTTTTCATTGTGTCTTAACAAAGGTGTGG 

HC2B 

HC2C — ~ " 

HC2D-KIAA1058 tTTTTTAAATCTCACTGGCAATATTCAAAGTTTTCATTGTGTCTTAACAAAGGTGT^ 

HC2E . IIIIIIIII'IIIIII"" I I 

HC2F 

GACACTCTTGAGCTGGACTTAGATTTTATTCTTCCTTGCAGAGTAGTGTTAGAATAGATG 

HC2-80 GACACTCTTGAGCTGGACTTAGATTTTATTCTTCCTTGCAGAGTAGTGTTAGAATAGATG 

HC2B 

HC2C ~ 

HC2D-KIAAi058 gacactcttgagctggacttagattttattcttccttgcagagtagtgttagaatagatg 

HC2E • IIIII~IZIII 

HC2F 

gcctacagaaaaaaaaggttctgggatctacatggcagggagggctgcactgacattgat 

HC2-80 gcctacagaaaaaaaaggttctgggatctacatggcagggagggctgcactgacattgat 

HC2B 

HC2C ~ — 

HC2D-KIAA1058 gCCTACAGAAAAAAAAGGTTCTGGGATCTACATGGCAGGGAGGGCTGCACTGACATT^ 

HC2E II IIIIIIIIIIIIIIIII I_I 

HC2F 

HC2A gcctgggggaccttttgcctcgactcgtgccggaaatctgatcgtaatcagggtacagaa 

HC2 - 8 0 gcctgggggaccttttgcctcgactcgtgccggaaatctgatcgtaatcagggtacagaa 

HC2B IIIIIIIIIIIIIIIIIIIIIIIIIIII----- 

HC2C ~ — — — • 

HC2D-KIAA1058 GCCTGGGGGACCTTTTGCCTCGAGGCTGAGCTGGAAAATCTTGAAAATATTTTTT----T 

HC2E . ~ "IIIIIIIIIIIIIIIIIIIII 

HC2F ' 

„C2A cTTACTAGTTTTGTCTAGGAGTATGTTGTATGACTAGGATTTGTGCTATTATCTCATTCA 

HC2-80 CTTACTAGTTTTGTCTAGGAGTATGTTGTATGACTAGGATTTGTGCTATTATCTCATTCA 

HC2B ^IIIIIIIIIIIIIIIIIIIIIIIII 

HC2C — — *~ ~ 

HC2D-KIAA1058 tTTCCTGTGGCACATTCAGGTTGAATACAAGAACTATTTTTGTGACTAGTTTTTGATGAC 

HC2E ■ II""IIII"IIIII~IIIIIIIIII--" 

HC2F 

HC2A acaacatagagcaagaatagtgagctaactgagctagacactcaattaatccgctactgg 

HC2-80 acaacatagagcaagaatagtgagctaactgagctagacactcaattaa^ 

HC2B II"IIIIIIIIIIIIIII I 

HC2C 

HC2D-KIAA1058 cTAAGGGAACTGACCATTGTAATTTTTGTACCAGTGAACCAGGAGATTT^^ 

HC2E II~II""I"II II 

HC2F 
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CTTCAAGTCAGAACTTTGTCATTAATCATCGACTCCGGGACGGTCATATATGTATTACAT 

- 8 0 CTTCAAGTCAGAACTTTGTCATTAATCATCGACTCCGGGACGGTCATATATGTATTACAT 

HC2B 

HC2C " " 

HC2D-KIAA1058 aTTCATTTCCTTGCATTTAAGAAAATATGAAAGCTTAAGGAATTATGTGAGCTTAAAACT 

HC2E 

HC2F 

„C2A ttctacatttttaatactcacatgggcttatgcattaagtttaattgtgataaatttgtg 

HC2-80 TTCTACATTTTTAATACTCy^CATGGGCTTATGCATTAAGTTTAATTGTGATAAATTTGTC 

HC2B 

HC2C " — — — 

HC2D-KIAA1058 aGTCAAGCAGTTTAGAACCAAAGGCCTATATTAATAACCGCAACTATGCTGAAAAGTAC^ 

HC2E ""IIIIIIIII IIIIIIIIIII-I 

HC2F 

HC2A ctggtccagtatatgcaatacactttaatggtttattcttgtcataaaaatgtgcaatat 

HC2 - 8 0 ctggtccagtatatgcaatacactttaatggtttattcttgtcataaaaatgtgcaatat 

HC2B 

HC2C ~ *~ — — — 

HC2D-KIAA1058 aAGTAGTACAGTATATTGTTATGTACATATCATTGTTAATACAGTCCTGGCATTCTGTAC 

HC2E " I- 

HC2F 

HC2 A GGAGATGTATACAAGTCTTTACT- ; III" 

HC2 -80 GGAGATGTATACAAGTCTTTACT IIIIII 

HC2B """""IIIIIII""""---- 

HC2C ~ 

HC2D-KIAA1058 aTATATGTATTACATTTCTACATTTTTAATACTCACATGGGCTTATGCATTAAGTTTAAT 

HC2E ■ IIIIII""II~IIIII"II--- I- 

HC2F 



HC2A 

HC2-80 " """"" ~ 

HC2B IIIIII" _I I " III 

HC2C "* ~* 

HC2D-KIAA1058 tGTGATAAATTTGTGCTGTTCCAGTATATGCAATACACTTTAATGTTTTATTCTTGTACA 

HC2E IIIIIII~III~I~IIIIIIIIIIII~-~ 

HC2F . 



HC2A III__ 

HC2-80 _ "IIIII" " I I 

HC2B mill" I I 

HC2C ~* ~ *~ 

HC2D-KIAA1058 tAAAAATGTGCAATATGGAGATGTATACAGTCTTTACTATATTAGGTTTATAAACA^ 

HC2E I~II~II~II"IIII~II"IIII II 

HC2F 
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HC2A = 

HC2-80 mil 

HC2B 

HC2C *^ 

HC2D-KIAA1 058 tAAGAATTTCATCCTTTTGCCAAAATGGTGGAGTATGTAATTGGTAAATCATAAATCCTG 

HC2E IIII'IIIIIIIIIIII' 11" 

HC2F 

HC2A IIIII'IIIIIIIIIIIII" 

HC2-80 \ ~ 

HC2B ~ " 1_ 

HC2C — 

HC2D-KIAA1058 TGGTGAATGGTGGTGTACTTTAAAGCTGTCACCATGTTATATTTTCTTTTAAGACATTAA 

HC2E 

HC2F . 

HC2A ZZZZ 

HC2-80 

HC2B " ~ 

HC2C *" — — — — — 

HC2D-KIAA1058 tTTAGTAATTTTATATTTGGGAAAATAAAGGTTTTTAATTTTATTTAACTGGAATCACTG 

jj^2E IIIIIIIIIII 

HC2F 

HC2A IIIII 

HC2-80 '1 

HC2B ~ ~ ^ 

HC2C — ' *" ~" 

HC2D-KIAA10 5 8 CCCTGCTGTAATTAAACATTCTGTACCACATCTGTATTAAAAAGACATTGCTGACC 

HC2E 

HC2F 
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CTTCAAGTCAGAACTTTGTCATTAATCATCGACTCCGGGACGGTCATATATGTATTACAT 

HC2-80 cTTCAAGTCAGAACTTTGTCATTAATCATCGACTCCGGGACGGTCATATATGTAT^ 

HC2B "Illlllllllllllllllim 

HC2C * *" "* " 

HC2D-KIAA1058 aTTCATTTCCTTGCATTTAAGAAAATATGAAAGCTTAAGGAATTATGTGAGCTTAAAA 

HC2E "IIIIIIIIIIIII-II 

HC2F 



HC2A ttctacatttttaatactcacatgggcttatgcattaagtttaattgtgataaatttgtg 

HC2-80 ttctac^tttttaatactcacatgggcttatgcattaagtttaattgtgata^ 

HC2B IIIIIIIIIIIIIII 

HC2C — "" — — — — ——— ——— — 

HC2D-KIAA1058 aGTCAAGCAGTTTAGAACCAAAGGCCTATATTAATAACCGCAACTATGCTC 

HC2E . • III" I-II 

HC2F 

HC2A ctggtccagtatatgcaatacactttaatggtttattcttgtcataaaaatgtgcaatat 

HC2 -80 ctggtccagtatatgcaatacactttaatggtttattcttgtcataaaaatgt^ 

HC2B ■ IIII'IIIIIIIIIII 

HC2C — — — — — — ~ 

HC2D-KIAA1058 aAGTAGTACAGTATATTGTTATGTACATATCATTGTTAATACAGTCCTGGCATTC 

HC2E mill I I- 

HC2F 

HC2A GGAGATGTATACAAGTCTTTACT IIIIII 

HC 2 - 8 0 GGAGATGT ATACAAGTC T TTACT III" 

HC2B II"IIIIIIIIIIIIIIIII I 

HC2C — — ———— — — — *" ■* 

HC2D-KIAA1058 aTATATGTATTACATTTCTACJVTTTTTAATACTCACATGGGCTTATGCATTAAOT 

HC2E . II'IIIIIII III~IIIII II 

HC2F : 

HC2A IIIIIIIIIIII""IIIIIIIIIII 

HC2-80 I_IIIIIIII 

HC2B I~I_I_I II 

HC2C — — ———— — ——— — — —— ———— 

HC2D-KIAA1058 tGTGATAAATTTGTGCTGTTCCAGTATATGCAATACACTTTAATGTTTTAT^ 

HC2E IIII"II IIIIIIIIIII I-I 

HC2F 

HC2A " ~ IIIII""IIIIIIIIIII 

HC2-80 " ~ II_.IIIIIII 

HC2B "*"" II_III_I_I I_I 

HC2C — ~" *" — — •— 

HC2D-KIAA1058 tAAAAATGTGCAATATGGAGATGTATACAGTCTTTACTATATTAGGTTTATA^ 

HC2E II"IIII"""IIIIII-I I-I 

HC2F 
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HC2A 

HC2-80 _II_IIIII_r 

HC2B "~ " 

HC2D-KIAA1058 '^Z^lllV^^^^^ 

HC2E IIIIIIIIIIIIIII-"" 

HC2F 

HC2A mill IIIIIIIIIIIII""- 

HC2-80 " 

HC2B ~~ 

HC2D-KIAA1058 i^i^i^T^TCTACTTTAAAGCTGTCACCATGT^ 

HC2E ■ - 

HC2F 

HC2A Illlllllllliri"" 

HC2-80 I 

HC2B I 

Sd-KIAA1058 ii^AOT^TTiilTAiiT^GAAAATAAAGGTTTT^ 

HC2E IIIIIIIIIIIIIIII I 

HC2F 

HC2A IIIIII 

HC2-80 ~ 

HC2B ' I I 

Ilc2D-KIAAl058 cCCTGCTGi^TTAAACATTCTGTACCACATCTG^^^ 
HC2E 

HC2F 
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HC2A 

HC2A-80 = 

HC2B ' 

HC2C — — — — — — 

HC2D ASG^^LDKNARFSAIYRQDSNKLSNDDMLKLIlADFRKPEKMAKLPVILGN^ 

HC2E 

HC2F 

HC2A 

HC2A-80 

HC2B IIIIIII 

HC2C 

HC2D FPNYVNSSYIPTKQFETCSKTPITFEVEEFVPCIPKHTQPYTIYTNHLyVYPKYLKYDSQ 

HC2E ZZZZ I 

HC2F 



HC2A 

HC2A-80 

HC2B 

HC2C 
HC2D 
HC2E 
HC2F 



-VLHHHQNPEFYDEIK 



KSFAKARNIAICIEFKDSDEEDSQPLKCIYGRPGGPVFTRSAFAAVLHHHQNPEFTOEIK 



HC2A lELPTQLHEKHHLLLTFFHVSCDNSSKGSTKKRDVVETQVGYSWLPIJ^GRVVTSEQHI 

HC2A-80 ■ ^~ 21" 

HC2B I 

HC2C • — — *" " ———— — — 

HC2D lELPTQLHEKHHLLLTFFHVSCDNSSKGSTKKRDVVETQVGYSWLPIJiKDGRVW 

HC2E ~~ """II 

HC2F ' 

HC2A pvSANLPSGYLGYQELGMGRHYGPEIKWVDGGKPIiLKISTHLVSTVYTQDQHLHNFFQYC 

HC2A-80 2 II I I II II 

HC2B IIII~II~I_ I II_I-I I 

HC2C —————— — — ~ 

HC2D PVSANLPSGYLGYQELC3VIGRHYGPEIKWVDGGKPLLKISTHLVSTVYTQDQHLHNFFQ 

HC2E IIIIIII I IIIIIIII 

HC2F ' 

HC2A QKTESGAQALGNEI.VKYLKSLHAMEGHVMIAFLPTIIiNQLFRVLTRATQEEVA^^^ 

HC2A'-80 *" ————— ——— — — ——————— — 

jj^2B AMEGHVMIAFLPTII.NQLFRVLTRATQEEVAVNVTRVI 

HC2C "~ — ————— — — 

HC2D QKTESGAQALGlsraLVKYLKSLHAMEGHVMIAFLPTII^QLFRVLTRATQEEVAVN^ 
^^2E AMEGHVMIAFLPTILNQLFRVLTRATQEEVAVNVTRVI 

HC2F ' 
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HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



IHWAQCHEEGIiESHLRSYVBCYAYKAEPYVASEYKTVHEELTKSMTTILKPSADFLTSNK 



IHWAQCHEEGLESHLRSYVKYAYKAEPYVASEYKTVHEELTKSMTTILKPSADFIiTSNK 



IHWAQCHEEGIiESHLRSYVKYAYKAEPYVASEYKTVHEELTKSMTTILKPSADFLTSNK 
IHWAQCHEEGLESHLRSYVKYAYKAEPYVASEYKTVHEELTKSMTTILKPSADFLTSNK 



LLRYSWFFFDVLIKSMAQHLIENSKVKLIJINQRFPASYHHAAETVV^^ 



LmYSWFFFDVLIKSMAQHLIENSKVKLIJU^QRFPASYHHAAETVVNMI^ 



LLKYSWFFDVI.IKSMAQHLIENSKVKIJ^QRFPASYHHAVETVV^^ 
LLRYSWFFFDVLIKSMAQHLIENSKVKLUOTQRFPASYHHAAETV^^ 



PEASKNANHSIAWIKRCFTFMDRGFVFKQINNYISCFAPGDPKTLFEYKFEFLRW 



PEASKNANHSIxAWIKRCFTFMDRGFVFKQIimYISCFAPGDPKTLFEYKFEE^ 



PEASKNANHSIAVFIKRCFTFbdDRGFVFKQINNYISCFAPGDPKTLFEYKFEFLR^ 
PEASKNANHSIJWFIKRCFTFMDRGFVFKQINNYISCFAPGDPKTLFEYKFEFLRWCra 



EHYIPLNLPMPFGKGRIQRYQDLQIJDYSLTDEFCRNHFLVGLLIJ^VGTALQEFREV^ 

QUDYSIiTDEFCRNHFLVGLLLREVGTALQEFREVRLI 

EHYIPI^PMPFGKGRIQRYQDMIJDYSLTDEFCRNHFI.VGLIJ^VGTALQEFRE^ 

EHYIPIJt^IiPMPFGKGRIQRYQDLQIX>YSLTDEFCRNHFLVGLIi^ 

EHYIPLNLPMPFGKGRIQRYQDLQIJ:>YSLTDEFCRNHFLVGLIiREVGTALQEFREVR^ 



AISVIJOTIXIKHSFDDRYASRSHQARIATLYLPLFGLLIENVQRINVRDVSPFPVNAG^ 
AISVLKOTJ:.iKHSFDDRYASRSHQARIATLYLPLFGIJ-IEWQRINVRDVSPFPVNAGMT 
AISVLKNLLIKHSroDRYASRSHQARIATLYLPLFGIilENVQRINVRDVSPFFV^ 

AISVLKNLLIKHSFDDRYASRSHQARIATLYLPLFGLLIENVQRINVRDVSPFPVNAGMT 
AISVLKOTJaIKHSFDDRYASRSHQARIATLYLPI.FGIJ:,IENVQRINVRDVSPFPV^ 



VKDESIJ^PAWPLVTPQKGSTIXJNSLHKDLLGAISGIASPYTTSTPNINSVKNA^ 

VKDESLALPAVNPLVTPQKGSTLDNSLHKDLLGAISGIASPYTTSTPNINSVRNADSRGS 

VKDESIJ^PAVNPLVTPQKGSTIJDNSIJIKDIiGAISGIASPYTTSTPNINS^ 

\nCDESIJU.PAVNPLWPQKGSTIJDNSIJIKDLLGAISGIASPYT 

VKDESIJ^PAWPLWPQKGSTLDNSLHKDLLGAISGIASPYTTSTPNINSVRNADSRG 

ADSRGS 
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HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



LI STDSGNSLPEPNSEKSNSLDKHQQS S TLGNSWRCDKLDQSE I KSLLMCFLY ILKSMS 
LISTDSGNSIiPEPNSEKSNSIiDKHQQSSTLGNSVNmCDKLDQSEIKSIiliMCFIiYILKSMS 
LISTDSGNSLPERKSEKSNSIiDKHQQSSTLGNSVN^CDKLDQSEIKSLLMCFLYILKSMS 

LISTDSGNSLPERNSEKSNSIiDKHQQSSTLGNSVVRCDKLDQSEIKSLLMCFLyiLKSMS 
LISTDSGNSLPERNSEKSNSLDKHQQSSTLGNSVVRCDKLDQSEIKSLLMCFLYILKSMS 
LI STDSGNSLPERNSEKSNSLDKHQQS STLGNSVVRCDKIJ)QSE IKSLLMCFLYILKSMS 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



DDALFTYWNKASTSELbdDFFTISEVCLHQFQYMGKRYIARNQEGLGPIVHDRKSQTL^ 

DDALFTYWNKASTSELMDFFTISEVCLHQFQYMGKRYIARNQEGLGPIVHDRKSQTLPVS 

DDALFTYWNKASTSELbdDFFTISEVCLHQFQYMGKRYIARNQEGLGPIVHDRKSQTLPVS 

DDALFTYWNKASTSELMDFFTISEVCLHQFQYMGKRYIAR 

DDALFTYWNKASTSELMDFFTISEVCLHQFQYMGKRYIARNQEGLGPIVHDRKSQTLPVS 
DDALFTYWNKASTSELMDFFTISEVCLHQFQYMGKRYIAS VR~KISSVLGIS 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



HC2A 

HC2A-80 

HC2B 

HC2C * 

HC2D 

HC2E 

HC2F 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



RNRTQ4MHARLQQLGSLDNSLTFNHSYGHSDADVLHQSLLEANIATEVCLTALDTLSLFT 
I^TCa^MHARLQQLGSLDNSLTFNHSYGHSDADVLHQSLLEANIATEVCLTALDTLSLFT 
R]^TGMMHARLQQLGSLDNSLTFNHSYGHSDADVLHQSLLEANIATEVCLTALDTLSLFT 

TGMMHARLQQLGSIJDNSLTFNHSYGHSDADVLHQSLLEANIATEVCLTALDTLSLFT 

RNRTGMMHARLQQLGSLDNSLTFNHSYGHSDADVLHQSLLEANIATEVCLTALDTLSLFT 
yr D_NG YGHSDADVLHQSLLEANIATEVCLTALDTLSLFT 

lAFKNQLIJ^HGHNPIMOCN^VYLCFLQKHQSETALKN^ 

LAFKNQLIJUDHGHNPI24KKV^VYLCFLQKHQSETALKNVFTALRSLIYKFP 

LAFK--LIJ^HGHNPLMKKSmDVyLCFLQKHQSETALKNVFTALRSLIY^^ 

lAFKNQLIJ^HGHNPLMKKS^VYLCFLQKHQSETALK^ 
IJ^KNQLLADHGHNPLMKK\^V!tLCFLQKHQSETALKNVFT^ 
LAFKNQLLADHGHNPLMKKK 

DMCAALCYEILKCCNSKLSSIRTEASQLLYFLMRNNFDYTGKKSFVRTHLQVIISVSQLI 
DMCAALCYEILKCCNSKLSSIRTEASQLLYFIMRimFDYTGKKSFVRTHLQVTISVSQLI 
DMCAALCYEILKCCarSKLSSIRTEASQLLYFIMO^NFDYTGKKSF^ 

DMCAALCYEILKCCNSKLSSIRTEASQLLYFIiMRNNFDYTGKKSFVRTHI^VI 
DMCAALCYEILKCCNSKLSSIRTEASQLLYFIMRIsnm)yTG 

ADWGIGETRFQQSLSIINNCANSDRLIKHTSFSSDVKDLTKRXRTVLMATAQMKEHE^ 

ADWGIGETRFQQSLSIIimCANSDRLIKHTSFSSDVKDLTKRIRTVI^TAQMK^ 

ADWGIGETRFQQSLSIIimCANSDRLIKHTSFSSDVKDLTKRIRTVIMATA^^ 

ADWGIGGTRFQQSLS I INNCANSDRLIKHTSFS SDVKDLTKRIRTVLMATAQ^4KEHEND 
ADWGIGETRFQQSLSIIimCANSDRLIKHTSFSSDVKDLTKRIRTVI^TAQMKEHEOT 
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HC2A pEMLVDLQYSIAKSYASTPELI^TWIJDSMARIHVKNGDLSELWU^ 

HC2A--80 pEMLVDLQYSIJ^SYASTPELRKTWIJ^SMARIHVKNGDLSEAAMCYVHVTAI^^^ 

HC2B pEMLVDLQYSIJ5a<SYASTPELRKTWIJ5SMARIHVKNGDLSEAAMCYVHV^ 

HC2C 

HC2D pEMLVDLQYSIJ^SYASTPELRKTWLDSMARIHVKNGDLSEAAMCYVHVTALVAEYL^ 

HC2E pEblLVDLQYSIJ^SYASTPEIJlKTWIJDSMARIHN^GDLSEAAMCYVHVT^ 

HC2F 



HC2A 

HC2A-80 

HC2B 

HC2C 

HC2D 

HC2E 

HC2F 



GVFRQGCTAFRVITPNIDEEASMMEDVGMQDVHFNE 

GVFRQGCTAFRVITPNIDEEASMbdEDVGMQDVHFNE 

GVFRQGCTAFRVITPNIDEEASMMEDVQ4QDVHFNE 

FRQGC T AFRVI T PN I DEE ASMMED VGMQDVHFNE 

EAVQWEPPLLPHSHSACI^SRGGVTOQGCTAFRVITPNIDEEASMblEDVGMQDVHF^ 
GVFRQGCTAFRVITPNIDEEASMMEDVG 



HC2A DV3JMELLEQCADGI.WKAERYELIADIYKLIIPIYEKRR 

HC2 A-8 0 DVX^IiLEQCADGLWKAERYELIAD I YKLI I PI YEKRR 

HC2B DVI^LIiEQCADGLWKAERYELIADIYKLIIPIYEKRRDFERLAHLYDTLHRAYSK 

HC2C DVLMELLEQCADGLWKAERYELIADIYKLIIPIYEKE^FERIJ^LYDT^^ 

HC2D DVI^LLEQCADGLWKAERYELIADIYKLI IPIYEI^LRDE^R^ 

HC2E 

HC2F 



KAERYELIADIYKLIIPIYEKRRDFERLAHIiYDTIiHRAYSK 



-DFFEDEDGKEYIYKEPKLTPLSE 
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1 



2 32 

GTT TTA CAC CAT CAC CAA AAC CCA GAA TTT TAT GAT GAG ATT AAA ATA GAG TTG CCC ACT 
val leu his his his gin asn pro glu phe tyr asp glu ile lys ile glu leu pro thr 

62 92 

CAG CTG CAT GAA AAG CAC CAC CTG TTG CTC ACA TTC TTC CAT GTC AGC TGT GAC AAC TCA 
gin leu his glu lys his his leu leu leu thr phe phe his val ser cys asp asn ser 

122 152 

AGT AAA GGA AGC ACG AAG AAG AGG GAT GTC GTT GAA ACC CAA GTT GGC TAC TCC TGG CTT 
ser lys gly ser thr lys lys arg asp val val glu thr gin val gly tyr ser trp leu 

132 212 

CCC CTC CTG AAA GAC GGA AGG GTG GTG ACA AGC GAG CAG CAC ATC CCG GTC TCG GCG AAC 
pro leu leu lys asp gly arg val val thr ser glu gin his ile pro val ser ala asn 

242 272 

CTT CCT TCG GGC TAT CTT GGC TAC CAA GAG CTT GGG ATG GGC AGG CAT TAT GGT CCG GAA 
leu pro ser gly tyr leu gly tyr gin glu leu gly met gly arg his tyr gly pro glu 

302 332 

ATT AAA TGG GTA GAT GGA GGC AAG CCA CTG CTG AAA ATT TCC ACT CAT CTG GTT TCT ACA 
ile lys trp val asp gly gly lys pro leu leu lys ile ser thr his leu val ser thr 
n ref 1.1, 1.2 and 1.3 

■ 362 11 

GTG TAT ACT CA<S-^GAT CAG CAT TTA CAT AAT TTT TTC CAG TAC TGT CAG AAA ACC GAA TCT 
val tyr thr gin asp gin his leu his asn phe phe gin tyr cys gin lys thr glu ser 



422 



452 



GGA GCC CAA GCC TTA GGA AAC GAA CTT GTA AAG TAC CTT AAG AGT CTG CAT GCG MG GAA 
gly ala gin ala leu gly asn glu leu val lys tyr leu lys ser leu his ala met glu 

482 512 

GGC CAC GTG ATG ATC GCC TTC TTG CCC ACT ATC CTA AAC CAG CTG TTC CGA GTC CTC ACC 
gly his val met ile ala phe leu pro thr ile leu asn gin leu phe arg val leu thr 

542 572 

AGA GCC ACA CAG GAA GAA GTC GCG GTT AAC GTG ACT CGG GTC ATT ATT CAT GTG GTT GCC 
arg ala thr gin glu glu val ala val asn val thr arg val ile ile hxs val val ala 

602 ^32 

CAG TGC CAT GAG GAA GGA TTG GAG AGC CAC TTG AGG TCA TAT GTT AAG TAC GCG TAT AAG 
gin cys his glu glu gly leu glu ser his leu arg ser tyr val lys tyr ala tyr lys 

662 

GCT GAG CCA TAT GTT GCC TCT GAA TAC AAG ACA GTG CAT GAA GAA CTG ACC AAA TCC ATG 
ala glu pro tyr val ala ser glu tyr lys thr val his glu glu leu thr lys ser met 
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722 752 

ACC ACG ATT CTC AAG CCT TCT GCC GAT TTC CTC ACC AGC AAC AAA CTA CTG AGG TAG TCA 
thr thr ilo leu lys pro ser ala asp plxe leu thr ser asn lys. leu leu arg tyr ser 

782 812 

TGG TTT TTC TTT GAT GTA CTG ATC AAA TCT ATG GCT CAG CAT TTG ATA GAG AAC TCC AAA 
trp phe phe phe asp val leu ile lys ser met ala gin his leu ile glu asn ser lys 

842 ICadherin Cleavage | 872 

GTT AAG TTG CTG CGA AAC CAG AGA TTT CCT GCA TCC TAT CAT CAT GCA GCG GAA ACC GTT 
val lys leu leu arg asn gin arg phe pro ala ser tyr his his ala ala glu thr val 

902 

GTA AAT ATG CTG ATG CCA CAC ATC ACT CAG AAG TTT GGA GAT AAT CCA GAG GCA TCT AAG 
val asn met leu met pro his ile thr gin lys phe gly asp asn pro glu ala ser lys 



962 

AAC GCG AAT CAT AGC CTT GCT GTC TTC ATC AAG AGA TGT TTC ACC TTC ATG GAC AGG GGC 
asn ala asn his ser leu ala val phe ile lys arg cys phe thr phe met asp arg gly 

ref 2.r| 

1022 1052 \} 

TTT GTC TTC AAG CAG ATC AAC AAC TAC ATT AGC TGT TTT GCT CCT GGA GAC CCA AAG ACC 

phe val phe lys gin ile asn asn tyr ile ser cys phe ala pro gly asp pro lys thr 

1082 1112 

CTC TTT GAA TAC AAG TTT GAA TTT CTC CGT GTA GTG TGC AAC CAT GAA CAT TAT ATT CCG 
leu phe glu tyr lys phe glu phe leu arg val val cys asn his glu his tyr ile pro 

1142 

TTG AAC TTA CCA ATG CCA TTT GGA AAA GGC AGG ATT CAA AGA TAC CAA GAC CTC CAG CTT 
leu asn leu pro met pro phe gly lys gly arg ile gin arg tyr gin asp leu gin leu 

1202 1232 

GAC TAC TCA TTA ACA GAT GAG TTC TGC AGA AAC CAC TTC TTG GTG GGA CTG TTA CTG AGG 
asp tyr ser leu thr asp glu phe cys arg asn his phe leu val gly leu leu leu arg 

1262 1292 

GAG GTG GGG ACA GCC CTC CAG GAG TTC CGG GAG GTC CGT CTG ATC GCC ATC AGT GTG CTC 
glu val gly thr ala leu gin glu phe arg glu val arg leu ile ala ile ser val leu 

n ref 3 . 1 

1322 1352 JL 

AAG AAC CTG CTG ATA AAG CAT TCT TTT GAT GAC AGA TAT GCT TCA AGG AGC CAT CAG GCA 
lys asn leu leu ile lys his ser phe asp asp arg tyr ala ser arg ser his gin ala 

1382 1412/471 

AGG ATA GCC ACC CTC TAC CTG CCT CTG TTT GGT CTG CTG ATT GAA AAC GTC CAG CGG ATC 
arg ile ala thr leu tyr leu pro leu phe gly leu leu ile glu asn val gin arg ile 

1442 l*"'^ 

AAT GTG AGG GAT GTG TCA CCC TTC CCT GTG AAC GCG GGC ATG ACC GTG AAG GAT GAA TCC 
asn val arg asp val ser pro phe pro val asn ala gly met thr val lys asp glu ser 
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1502 X532 

CTG GCT CTA CCA GCT GTG AAT CCG CTG GTG ACG CCG GAG AAG GGA AGO ACC CTG GAG AAC 
leu ala leu pro ala val asn pro leu val thr pro gin lys gly ser thr leu asp asn 

n re£ 4 . 1 and 4 . 2 
1562 1592 jL 

AGC CTG CAC AAG GAC CTG CTG GGC GCC ATC TCC GGC ATT GCT TCT CCA TAT ACA ACC TCA 
ser leu his lys asp leu leu gly ala ile ser gly ile ala ser pro tyr thr thr ser 

1622 1652 

ACT CCA AAC ATC AAC AGT GTG AGA AAT GCT' GAT TCG AGA GGA TCT CTC ATA AGC ACA GAT 
thr pro asn ile asn ser val arg asn ala asp ser arg gly ser leu ile ser thr asp 

ref 5.1 suid 5.2 

1682 "12 

TCG GGT AAC AGC CTT CCA GAA AGO AAT AGT GAG AAG AGC AAT TCC CTG GAT AAG CAC CAA 
ser gly asn ser leu pro glu arg asn ser glu lys ser asn ser leu asp lys his gin 

1742 1772 

CAA ACT AGC ACA TTG GGA AAT TCC GTG GTT CGC TGT GAT AAA CTT GAC CAG TCT GAG ATT 
gin ser ser thr leu gly asn ser val val arg cys asp lys leu asp gin ser glu ile 

1802 * 1832 

AAG AGC CTA CTG ATG TGT TTC CTC TAG ATC TTA AAG AGC ATG TCT GAT GAT GCT TTG TTT 
Xys ser leu leu met cys phe leu tyr ile leu lys ser met ser asp asp ala leu phe 

1862 1892 

ACA TAT TGG AAC AAG GCT TCA ACA TCT GAA CTT ATG GAT TTT TTT ACA ATA TCT GAA GTC 

thr tyr trp asn lys ala ser thr ser glu leu met asp phe phe thr ile ser glu val 

n ref 6.1 

1922 Jl 1952 

TGC CTG CAC CAG TTC ^G TAC ATG GGG AAG CGA TAC ATA GCC AGG AAC CAG GAG GGG TTG 

cys leu his gin phe gin tyr met gly lys arg tyr ile ala arg asn gin glu gly leu 

1982 2012 

GGA CCC ATA GTT CAT GAT CGA AAG TCT CAG ACA TTG CCT GTT TCC CGT AAC AGA ACA GGA 
gly pro ile val his asp arg lys ser gin thr leu pro val ser arg asn arg thr gly 

2042 2072 

ATG ATG CAT GCC AGA TTG CAG CAG CTG GGC AGC CTG GAT AAC TCT CTC ACT TTT AAC CAC 
met met his ala arg leu gin gin leu gly ser leu asp asn ser leu thr phe asn his 

2102 2132 

AGC TAT GGC CAC TCG GAC GCA GAT GTT CTG CAC CAG TCA TTA CTT GAA GCC AAC ATT GCT 
ser tyr gly his ser asp ala asp val leu his gin ser leu leu glu ala asn ile ala 



ref 7.1 



2162 2192 

ACT GAG GTT TGC CTG ACA GCT CTG GAC ACG CTT TCT CTA TTT ACA TTG GCG TTT AAG XAC 
thr glu val cys leu thr ala leu asp thr leu ser leu phe thr leu ala phe lys asn 

2222 2252 

CAG CTC CTG GCC GAC CAT GGA CAT AAT CCT CTC ATG AAA AAA GTT TTT GAT GTC TAC CTG 
gin leu leu ala asp his gly his asn pro leu met lys lys val phe asp val tyr leu 

2282 2312 
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TGT TTT CTT CAA AAA CAT CAG TCT GAA ACG GCT TTA AAA AAT GTC TTC ACT 6CC TTA AGG 
oys pHe leu sin lys his gin ser glu thr ala leu lys asn val phe thr ala leu arg 

2372 

Tc"tTA ATT TAT AAG TTT CCC TCA ACA TTC TAT GAA GGG A6A GCG GAC ATG TGT GCG GCT 
ser leu ile tyr lys phe pro ser thr phe tyr glu gly arg ala asp met cys ala ala 

2402 2432 

CTG TGT TAC GAG ATT CTC AAG TGC TGT AAC TCC AAG CTG AGC TCC ATC AGG ACG GAG GCC 
leu cys tyr glu ile leu lys cys cys asn ser lys leu ser ser ile arg thr glu ala 

2462 2492 

TCC CAG CTG CTC TAC TTC CTG ATG AGG AAC AAC TTT GAT TAC ACT GGA AAG AAG TCC TTT 
ser gin leu leu tyr phe leu met arg asn asn phe asp tyr thr gly lys lys ser phe 

2522 2552 

GTC CGG ACA CAT TTG CAA GTC ATC ATA TCT GTC AGC CAG CTG ATA GCA GAC GTT GTT GGC 
val arg tHr his leu gin val ile ile ser val ser gin leu ile ala asp val val gly 

2612 

JIt^GGG GAA ACC AGA TTC CAG CAG TCC CTG TCC ATC ATC AAC AAC TGT GCC AAC AGT GAC 
ile gly glu thr arg phe gin gin ser leu ser ile ile asn asn cys ala asn ser asp 

2642 2672 

CGG CTT ATT AAG CAC ACC AGC TTC TCC TCT GAT GTG AAG GAC TTA ACC AAA AGG ATA CGC 
arg leu ile lys his thr ser phe ser ser asp val lys asp leu thr lys arg lie arg 

2702 2732 

ACG GTO CTA ATO GCC ACC GCC CAG ATG AAG GAG CAT GAG AAC GAC CCA GAG ATG CTG GTG 
thr val leu met ala thr ala gin met lys glu his glu asn asp pro glu met leu val 

2762 2792 

GAC CTC CAG TAC AGC CTG GCC AAA TCC TAT GCC AGC ACG CCC GAG CTC AGG AAG ACG TGG 
asp leu gin tyr ser leu ala lys ser tyr ala ser thr pro glu leu arg lys thr trp 

2852 1 XXXXXXXXXXXXX3CX Predicted 

CTC GAC AGC ATG GCC AGG ATC CAT GTC AAA AAT GGC GAT CTC TCA GAG GCA GCA ATG TGC 
leu asp ser met ala arg ile his val lys asn gly asp leu ser glu ala ala met oys 

Transmeiribrane Domain xxMaaaaDDaaaaaixxxxxxxxxxx | 

St GTC CAC OTA ACA GCC CTA GTG GCA GAA TAT CTC ACA CGG AAA GGC GTG TTT AGA CAA 

tyr val his val thr ala leu val ala glu tyr leu thr arg lys gly val phe arg gin 

2972 

^%GC ACC GCC TTC AGG GTC ATT ACC CCA AAC ATC GAC GAG GAG GCC TCC ATG ATG GAA 
gly cys thr ala phe arg val ile thr pro asn ile asp glu glu ala ser met met glu 
ref 8 -in 

|^C%TG GGG^ATG CAG GAT GTC CAT TTC AAC llf GAT GTG CTG ATG GAG CTC CTT GAG CAG 
asp val gly met gin asp val his phe asn glu asp val leu met glu leu leu glu gin 

TG^ GCA GAT GGA CTC TGG AAA GCC GAG CGC TAC GAG CTC ATC GCC GAC ATC TAC AAA CTT 
l^s Si IbI gly leu trp lys ala glu arg tyr glu leu ile ala asp ile tyr lys leu 
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n ref 9.1 

3122 JL ^^^^ 

ATC ATC CCC ATT TAT GAG AAG CGd^AGG GAT TTC TTT GAA GAT GAA GAT GGA AAG GAG TAT 
ile ile pro ile tyr glu lys arg arg asp phe phe glu asp glu asp gly lys glu tyr 

3182 3212 

ATT TAG AAG GAA CCC AAA CTC ACA CCG CTG TCG GAA ATT TCT CAG AGA CTC CTT AAA CTG 
ile tyr lys glu pro lys leu thr pro leu ser glu ile ser gin arg leu leu lys leu 

ref 10 

3242 3272 i} 

TAC TCG GAT AAA TTT GGT TCT GAA AAT GTC AAA ATG ATA CAG GAT TCT GGC AAG GTC AAC 

tyr ser asp lys phe gly ser glu asn val lys met ile gin asp ser gly lys val asn 

3332 

S^AAG GAT CTG GAT TCT AAG TAT GCA TAC ATC CAG GTG ACT CAC GTC ATC CCC TTC TTT 
pro lys asp leu asp ser lys tyr ala tyr ile gin val thr his val ile pro phe phe 

3392 

I^C^GAA AAA GAG TTG CAA GAA AGG AAA ACA GAG TTT GAG AGA TCC CAC AAC ATC CGC CGC 
asp glu lys glu leu gin glu arg lys thr glu phe glu arg ser his asn ile arg arg 

3422 3452 

TTC ATG TTT GAG ATG CCA TTT ACG CAG ACC GGG AAG AGG CAG GGC GGG GTG GAA GAG CAG 
phe met phe glu met pro phe thr gin thr gly lys arg gin gly gly val glu glu gin 
n re£ 11.1 

3482 JL 3512 

TGC AAA CGG CG^ ACC ATC CTG ACA GCC ATA CAC TGC TTC CCT TAT GTG AAG AAG CGC ATC 
cys lys arg arg thr ile leu thr ala ile his cys phe pro tyr val lys lys arg ile 

3572 |«3cxxxxxx Coiled-coil 1 xxxxxx 

CCT GTC ATG TAC CAG CAC CAC ACT GAC CTG AAC CCC ATC GAG GTG GCC ATT GAC GAG ATG 
pro val met tyr gin his his thr asp leu asn pro ile glu val ala ile asp glu met 

3602 jQaaoQCXX Coiled coil 1 cont'd xxmc 3632 xxx3txxxxxxxxxxxxxxxxxxxxaixxx»Qaai 
ACT AAG AAG GTG GCG GAG CTC CGG CAG CTG TGC TCC TCG GCC GAG GTG GAC ATG ATC AAA 
ser lys lys val ala glu leu arg gin leu cys ser ser ala glu val asp met lie lys 

n ref 12.1 

3662 xxxxxxxxxxxxjtxiatxxxxx 1 3692 M 

CTG CAG CTC AAA CTC CAG GGC AGC GTG AGT GTT CAG NStC AAT OCT GGC CCA CTA GCA TAT 

leu gin leu lys leu gin gly ser val ser val gin val asn ala gly pro leu ala tyr 

3722 3752 

GCG CGA GCT TTC TTA GAT GAT ACA AAC ACA AAG CGA TAT CCT GAC AAT AAA GTG AAG CTG 
ala arg ala phe leu asp asp thr asn thr lys arg tyr pro asp asn lys val lys leu 

3812 IjcxxxMoacxxxxxxxxxx 
CTT AAG GAA GTT TTC AGG CAA TTT GTG GAA GCT TGC GGT CAA GCC TTA GCG GTA AAC GAA 
leu lys glu val phe arg gin phe val glu ala cys gly gin ala leu ala val asn glu 

3842 xxxxxxx Coiled coil 2 xxxxxxxxxx 3872 xxxxxxxxxxx»cxxxxkxxxxxxxxxxxm« 
Jlt c^^AAA 6AA GAC CAG CTC GAG TAT CAG GAA GAA ATG AAA GCC AAC TAC AGG GAA 
arg leu ile lys glu asp gin leu glu tyr gin glu glu met lys ala asn tyr arg glu 

3902 xKxxxxxxxxxxxMcicaacxacxxxxxxxxxxxatx 3932 X3at\ 
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ATG GCG AAG GAG CTT TCT GAA ATC ATG CAT OAQ CAG ATC TGC CCC CTG GAG GAG AAG ACG 
met ala lys glu leu ser glu ile met his gXu gin ile cys pro. leu glu glu lys tlir 

3992 

Itc^GTC TTA CCG AAT TCC CTT CAC ATC TTC AAC GCC ATC AGT GGG ACT CCA ACA AGC ACA 
ser val leu pro asn ser leu his ile phe asn ala ile ser gly thr pro thr ser tHr 

4022 Ixxxx PBM xxxxx| 

ATG GTT CAC GGG ATG ACC AGC TCG TCT TCG GTC GTG TGA TTA CAT CTC ATG GCC CGT GTG 
met val his gly met thr ser ser ser ser val val STP 

4082 
TGG 



GGA CTT GCT TTG TCA TTT GCA AAC TCA GGA TGC TTT CCA AAG CCA ATC ACT GGG GAG 



4142 "72 
ACC 



'gag CAC AGG gag GAC CAA GGG GAA GGG GAG AGA AAG GAA ATA AAG AAC AAC GTT ATT 



4202 

TCT TAA CAG ACT TTC TAT AGG AGT TGT AAG AAG GTG CAC ATA TTT TTT TAA ATC TCA CTG 



4262 4292 
GCA 



ATA TTC AAA GTT TTC ATT GTG TCT TAA CAA AGG TGT GGT AGA CAC TCT TGA GCT GGA 



4322 '*352 

CTT AGA TTT TAT TCT TCC TTG CAG AGT AGT GTT AGA ATA GAT GGC CTA CAG AAA AAA AAG 



4382 



GTT CTG GGA TCT ACA TGG CAG GGA GGG CTG CAC TGA CAT TGA TGC CTG GGG GAC CTT TTG 



ref 13.1 



4442 Jl ^^'^^ 
CCT CGa\:TC GTG CCG GAA ATC TGA TCG TAA TCA GGG TAC AGA ACT TAC TAG TTT TGT CTA 



4502 4532 
GGA 



GTA TGT TGT ATG ACT AGG ATT TGT GCT ATT ATC TCA TTC AAC AAC ATA GAG CAA GAA 



4562 ^592 

TAG TGA GCT AAC TGA GCT AGA CAC TCA ATT AAT CCG CTA CTG GCT TCA AGT CAG AAC TTT 

n ref 14.1 

4622 4^52 JL 

GTC ATT AAT CAT CGA CTC CGG GAC GGT CAT ATA TGT A&T ACA TTT CTA CAT TTT TAA TAC 
4682 4712 

TCA CAT GGG CTT ATG CAT TAA GTT TAA TTG TGA TAA ATT TGT GCT GGT CCA GTA TAT GCA 
4742 4.712 

ATA CAC TTT AAT GGT TTA TTC TTG TCA TAA AAA TGT GCA ATA TGG AGA TGT ATA CAA GTC 

4802 
TTT ACT 
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BAC sequences of Human CLASP 2 
Ref 11 

Sequence of BAC4 using primer HC2AS2, which spans nucleotides 327-346 of the cDNA. Exon 

sequence is underlined and represents nucleotides 356-375. ^^n^^^r^r^r^ 

^rTACAGNOTNTACTCAGG TATGTGCTCCTTCAACAAAATTAGCAGTTGCTGCTCT 

GTGACAAAGTTTGCACCATTTTGCAAGAAGAAAAAAATCCTAATGTGTTATATTAX:^^^ 

TATTTTTACTCTATAGATCTTTTTCTAAAGAAAGAAAGTACAACTGAAGTGCTTATAT 

GTATTCATATAAATGACTAGTACAAGCATCATTTTGCAACAGATTTCCCCTTTCATTG 

GAGGATCTTCTTGATGTTATTTGTACACGATCAATTTTTAGTCTTAATAAGATGAGGC 

TGGGTGTGGTGGCTCACACCTGTAATCCTAGCATTTTGGAGGCCAAGGTGGGCAGAT 

CACITTAGCCCAGGGGTTTGAGACCAGCCTGGCCAACATGGCAAAACCTTGT^^^^ 

CAAAAATACNAAAATTATCCAGGCATGGTGATGTGTGCCTGTAGTCCCAACTNCCTAG 

GAGGCTAGGGGTAGGGGGATTTGCAAGAGGCTGGGAGGGTCAAAGCeCNAANTGAG 

CCATTGGTNCATGTCACTTGGACCCCAAGCNNGGGGNGANCAAGAGCAAAGGACTNN 

TGTNNTTTANAAAAAAAACCGGGCTACCATACNNACCAACCCNCNNAC 

TTTCCAIWTTAAAANAAGGCTTTGNCTTGCANAGGAAAANCAAAAlTSfNCC 

Ref 1 2 

Sequence of BAC26 using primer HC2AS2, which spans nucleotides 327-346 of the cDNA. Exon 
sequence is underlined and represents nucleotides 351-375. ^^t^^t 
TrTGGTTTCTArAGTGTATACTNAGCT ATCTGCTC^I^^^ 

TACTATA?rrTTACTCTATAGATCTTTTTCTAAAGAAAGAAAGTACAACTGAAG^ 
I^ATGTATTCATATAAATGACTAGTACAAGCATCAXm 

Ittggaggatcttcttgatgttatttgtacacgatcaa^^^ 

AGGCTGGGTGTGGTGGCTCACACCTGTAATCCTAGCATTTTGG^^ 

agatcactttagcccaggggtttgagaccagcctggccaacatggcaaaaccttgtc 

TCTACAAAAATACAAAAATTATCCAGGCATGGTGATGTGTGCCTGTAGTCCC^^ 

^SggI^Igg'^ggggattgcaagaggctngg^^^^^^^ 

GCCATGGTCATGTCACTGCACCCCCAGCCAGGGCCGACAGGA^^^ 

caaaaaaaaacagnaaccaacanccaacaacaacaacnacctttcngcaaaanaagc 

TTGCTNCAANGAAACCAAAATGNCTTCTTNTTTTCCCCCN 

t^u^nce of BAC26 using primer HC2AS2, which spans nucleotides 327-346 of the cDNA. Exon 
sequence is not found within this sequence. This sequence most likely represent mtron sequence 



A^^IS^TCCxllGTGTNATATCACTATATr^T^^^^^^ 

?I??actctoaI?IIgS?g^gctg^^ 

GC^GTCGC^A^C^ANCCNCGGTNGAGACAGCT^^ 
?i^2^^CTCCGCNGGNGATGGGCTGAGC^^^ 
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GNNGAGNTGTNGCACGNCGAGNGAGCATGNTCTGTACTGACTCATCAGGCGNCNACA 
CGNTCTGTTCNAAAACATACCACACACACTCNCACCTNCGCAAAATTGCTCTNNAAAN 

a?2cttntttca5acngntncaat^^ 
^atc^SctgcSaaSatncgnccacctn^ 

TTATAGCTCCCCTACACNTNNCAGC^ 

SJItIctctctccgta^^^^ 

CNNTTNANCNGTOCGCTATCTCTNNN^ 

aSSct^tnScSScactantcacaacttntncntnnaa^^ 
ctaccaSctcacttactacctnttcacncantctccttcnctntccactgatctccaca 

TAGCTGCTNTACTCGCCANTTTATCATATNCACACNCTCTACGCTNNNTNT 
SelueLce of BAC4 using primer HC2S1, which spans nucleotides 1107-1126 of the cDNA. Exon 



AAGTTTATCCi 
TNTAGTCNCN 

Ref3.1 



"Ref 3 1 

Sequence of BAC4 using primer C96 AS, which spans nucleotides 1443-1452 of the cDNA. Exon 
sequence is underlined and represents nucleotides 1370-1422. ^^^^ _^ . ^„ „™™>^ 
AATCAGCAGACCAAACAGAGG^ 

AAAAGAAGACACACATGGTAAGTTTGACCCAGGATTCTGAGAACCGAACTAAGTTGG 
^^GtcCATCTCm 

J^^agIgcIS^Vcttagtgctgttaccatga^^ 

A^lcCACTTAAlArAAAGGACATTATC^^^ 
CCCTTGAAGCA^iTCTGTCATCAAAAGAATGCTTrATCAG 

?iI?AS?C^CACC:^G^^^ 

???G^^f^A^^^^^^^ 

^II^gX^^^^^^^^^^^^ 
TAANTGAAAACCAATTTAAATCCAATTNAGGNG 
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Ref 4.1 

Sequence of BAC4 using primer C2AS5, whicli spans nucleotides 1716-1735 of the cDNA. Exon 
sequence is underlined and represents nucleotides 1602-1703. 

TTCCTTTCTGCAAGGCTOTTCCCGAATrTnTGCTTATGAGAGATCCTCTCGAATCA GC 

ATTTCTCACACTGTTGATGTTTGGAGTTGAGGTTGTATA TGGAGAAGCTAAATGGAAA 

TCAAGCCAACAATAAAGTTTTATTAAGACAGAACAAAATAAAGATGAGTACTGAACTT 

TAAGGGAAATTGCTTTTATTGCACTTATTTTTTCTGTTAGGAAGTTGGCTCAAGAGTT 

GCATTCCATTACTTCACCTTTAAAGAACCAGGTCATATACAATGAGATAAAAAGAAAC 

TAGTCTGAAACATTCAGATGTAAACATCAATTCACTTGTTAGAAACCACCTTTGATCG 

CTAAAGACTAAATGCATACCTGTTTCAGAATGTGATAGAATGAAGACTTAAAAAAATT 

AAAAGATAAATCCACCTACAACTATCAAATCACAAAATTAAACCACACAACAAACTTG 

TAGCATTCAAACTGGTAATAAACACTGAGGAGCCTACCCAACTCTGAGGGGTGTCAT 

GGGGTATTTTAAATTTTCGAGGAGAACACAGTGATATGTGACCTCAGCCAGAAGCTG 

CTGTTTNAGCAGCAGGTTGGTGCTATGCTCCTTTTTGAAGACATATTTGTGAAGCTGG 

GTATTTTGGGGGGCCTGCTTATGATAAAANGGCAAGGTNTTCAATGNAGGGGN 



Ref 4.2 

Sequence of BAC26 using primer C2AS5, which spans nucleotides 1716-1735 of the cDNA. Exon 



sequence is underlined and represents nucleotides 1602-1703. 
TTCCTTTCTGGAAGGCTGTTACCCGAATCTGTGCTTATGAGA GATCCTCTCGAATCAG 

CATTTCTCACACTGTTGATGTTTGGAGTTGAGGTTGTATATGGAGAAGCT AAATGGAA 

ATCAAGCCAACAATAAAGTTTTATTAAGACAGAACAAAATAAAGATGAGTACTGAACT 

TTAAGGGAAATTGCTTTTATTGCACTTATTTTTTCTGTTAGGAAGTTGGCTCAAGAGT 

„ „ . . r™, 4 ^r^Tj-i A yiz-t'TTT A A A A A*-«*-> Am/riTr" ATAXAfA ATfJAnATAAAAAGAAA 



TTAAGGGAAATTGCTTl lAl IGCAt-l lAi iif\\jvjr.rt_H.vjx ivrvjv-i.'^rv^vjr^^ ^ 

TGCATTCCATTACTTCACCTTTAAAGAACCAGGTCATATACAATGAGATAAAAAGAAA 
CTAGTCTGAAACATTCAGATGTAAACATCAATTCACTTGTTAGAAACCACCTTTGATC 
GCTAAAGACTAAATGCATACCTGTTTCAGAATGTGATAGAATGAAGACTTAAAAAAAT 
TAAAAGATAAATCCACCTACAACTATCAAATCACAAAATTAAACCNCACAACAAACTT 
GTAGCATTCAAACTGGTAATAAAACACTGAGGAGCCTACCCAACTTTGAGGGGTGTC 
AATGGGGTNTTTTTAAATTTTTCGNGGGANANCCCAGTGNTATGGTGACCTTCACCCA 
AGAAGCTTGTTTGTTTNACCAAGCNAGGTTGNNCTNTGCTCCTTTTTAGAANACNNTA 



Ref 5.1 ^^^^ 
Sequence of BAC4 using primer C2S6, which spans nucleotides 1686-1705 of the cDNA. Exon 



sequence is underlined and represents nucleotides 1724-1736. 
TTCCTGGATAAGGTAATTGCTTTTACCCAACACAAATGTTTCTTATAATCAATGGATT 

TAGCCCAAAGTAAACGTACTTCATGTTCTAGTGCCTTTTAAGTGTGACCTTTTGTTTT 

TTTCTAAACCACCCGGCTGACCTGGAGTAGGTGATGAGAGCTTTAAGGTTGGGGCCC 

ATTCCTTGAAGTGCTCTGATTCCTGTTTCCAGTACCTCAGATCCTGGGCAGGGTTTGC 

AGTGGAGCGTCTTGAGTGAATGGCTCTGGTGGGTTGAACGGGGAGGGACTCAAAAT 

GCTGCCCATCTCAATTTCCTGTAGTCTTTTTATTTATTTATTTATTTTTTGAGACAGAG 

TCTCGCTCTGTCGCCCAGGCTGGAGTACAGCGGCACGATCTCAATTNACTGCAACCT 

CCGCCTCC:TGGGTTCAAACGACTCCTCTGCCTCAGCCTCCCCAGCAGC:TGGGACCA 

CAGGCACAAGCCACCACCGCCCGGCTAATTTTTTGTNTTTTTAGTA-.GAGAT.GGGGT^ 

TCACCATATTTGGCCAGGCTGGGCTCAAACTCCTGACC-.TCGTCATCCGCNCCCTCGG 

NCTNCCAAAGTGCTTGGGATTNCAGGCNGTGAGCCCACTTACACCTNGGGCAATTCC 

CTCTNAGTCTTTTTTACCAGAGACACCATCATTCAACACAGCTTTTCCACCCACAA 
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Ref5.2 

Sequence of BAC26 using primer C2S6, which spans nucleotides 1686-1705 of the cDNA. Exon 

sequence is underlined and represents nucleotides 1712-1736. 
TnAGAAGAGrAATTTrCTGGATAAG GTAATTGCTTTTACCCAACACAAATGTTTCTTA 

TAATCAATGGATTTAGCCCAAAGTAAACGTACTTCATGTTCTAGTGCCTTTTAAGTGT 

_ . ^™,^r,^r^-^rw^mrr.y-.rr. A » A A /"^/"'r-r-' i^Tn A rT'xnr; A fiT A OfiTG ATG AG AGCTTTA 



TAATC AATGGATTTAGCCC AAAG l AAALJij i Ai^ 1 1 »- a i 1 1 v. x ^.v. j. x x x x x * 
GACCTTTTGTTTTTTTCTAAACCACCCGGCTGACCTGGAGTAGGTGATGAGAGCTTTA 
AGGTTGGGGCCCATTCCTTGAAGTGCTCTGATTCCTGTTTCCAGTACCTCAGATCCTG 
GGCAGGGTTTGCAGTGGAGCGTCTTGAGTGAATGGCTCTGGTGGGTTGAACGGGGA 
GGGACTCAAAATGCTGCCCATCTCAATTTCCTGTAGTCTTTTTATTTATTTATTTATTT 
TTTGAGACAGAGTCTCGCTCTGTCGCCCAGGCTGGAGTACAGCGGCACGATCTCAAT 
TCACTGCAACCTCCGNCTCCCTGGGTTCAAACGACTCCTCTGNCTNAGNCTCCC:AGC 
AGCCTGGGAACCACAGGCTCANGCCACCACGCCCGGCTAATTNTTGTAATTTTNAGT 
AANAAATTGGGGGTTCTCACCATNTTGGCCCAAGNCTTGGGCCTAAAAACCTTNCTNA 
CCNTCGNCATTCNCNCCCCNACCNTGGGCNCTNCTCAAANGNGCTTGGGGATTTANC 
ANNGGCNTTAACCCCCCNTATCACCGTGGNCCTTAATTT 



ScMuence of BAC4 using primer C2S7, which spans nucleotides 1918-1937 of the cDNA. Exon 
sequence is not found within this sequence.Since the primer is directed against exon sequence we 



CTACAGAGACCAGCCAAGTTTGACAriAGAHJA«^i^iiiA«jr\^AAxv,ii^ iv,v.^*«ur..x 
GTTTCTTGCTAAAACAGGTTAAGCCTTGCAGCCACTTTATCTGTAACTGGCNGAGGTT 



TTGACATAAAA 



Ref 7 1 

Sequence of BAC4 using primer C2S8, which spans nucleotides 2143-2162 of the cDNA. Exon 

sequence is underlined and represents nucleotides 2182-2219. ^r^rr^^r-^r-T^ 
CTCTCGACArGCTGTTTCTATTAACATTGGrGTTTAAG GTTTGTATCAATTTGCTGTT 

CGNGGTTCTAGTTTTACCTTTCACATTCATTCTGCTTGGTAAGCTCAGTGAGO^^^^^ 

CTTACTATGTTGCATTTTTACTTCAGCAATTATTTTTGTCCCTGTAAGGAAACCAT^ 

TCTTTAAATTCCTTTAATGAAATCATTCCACAGTGAATGGCTTGAATGCCCTGAAATA 

AAATTTAACTGGTCAGTGTGTGCTGCGCGCTTGGGTATGGTGGAAACACGGTCTCTG 

GAGGCAGTTAACTCTTGGCTCGAACCTTGAGGATGGTGAATATAGGCACCTAATCAG 
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GCATTTCTGCCTTGAATATCTTTAAATATATCCAAATGTTATAGCGTTTAATTAGATTT 

WGTAGAAAGGAGCAATAAACACAAGACACATGTTITCAGTTTmATCT^ 

^ATTAiATGATAAAAACGlTTTGGAGATAGAAAATGAAAGGGGTTTTTTTTTTGTCT 

TGTTTTAAAGTITTAGCAAATAATATTCAAGTAGGTGGAGATGGACTCTTCACCACTC 

TCCTGTTTTTAGGAACCCAATACTITTrCATTCTTGCTAAATGATTACTTCCATrTCTA 

GCATAGAAAAGGAGAAAATTGGAATGAGTGTTTATAT 

Ref 8 1 

Sequence of BAC4 using primer C2S9, which spans nucleotides 2992-3011 of the cDNA. Exon 
sequence is not found within this sequence. Since the primer is directed against exon sequence we 



oresume that sequence derived from C2S9 is intron sequence 
CGCmNlA^TNCCAGC 

GTGATGCCTGGCTCTGATTGTGTGGGATTGGTCATCAGTGGCGGTTGGCAGNTGGGC 
TTCATGGAAGCGGCCATGGGGACTGATGGCAGGCCCTTGGATTGCCACCGCAGAGCC 
TGGCAGTGTCTTTGGTCTGCATTCCTACCGGCGAAGTCTCATTTCACCTCACGTGTTA 
TCTCTTGGAAAGCATTCCTTTAGCGGGCTGTGTCTACCCTTCCATCCTCTCGTCCAAA 
rTCCCCCTCCTTCTCTGTTCTGTCTCCTTCCCATCCTCTTCTCCCCAGTTCTTCTTCCT 

atgtSccttcctcagtggtttctcttcctctgtttgactttccaaggtcattttgactg 

TTCCTGCTCCCAACTACAAAGATACTAAAATCTCACCTAACCACTCTTCITCmC^^^ 

atgaaagaatgttttcagtccatcccaaatttgtgtggacttcacaaaccttctctaa 

AATGGAGCCTTTTCTCTTCCTACTCTTGACTAGNTGGTAAACGCTCCATGTTCTTGGC 

cagaactccctggtgagtagcgtcactcccactttcctgtgcagaaccaagcctcct 

AGAAAACTCCTTTGCANCTGAGTGGGTTGGGACACGCCCTTTNTTTGGG 
Ref 9 1 

Sequence of BAC4 using primer C2AS10, which spans nucleotides 3276-3295 of the cDNA. Exon 

sequence is underlined and represents nucleotides 3147-3234. ^^^^ , ^ ^ .-.^^i-T 

-rfTANACCNATNTATCCGNG T r AGTTANAGGAGTCTCTGAGA A ATTTrCGACAGCGGT 
, ^^r^. . ArrArrA/-^T'i-<-^Tr'i-rr'f^ATrf"TTr'ATrTTrAAAGAATCCCl 



GTGACATAAAGCACAATTAGAGCTATCCCTGAACGTAAGCCCAGGGCTTACCACCTA 
GGAAGCGTTCTTTTATTACAAGGGGGAAAAAAAGGAATGGGTCTAAAAATCCAGCTC, 
AAATGGGCTTTCTGAATGAGAAAGAAAATGCTAATAACATGAAGTCTAGGTGCAAAG 

StLu^ggaaaaacacaacattgcaaacttattcaagaatgcagtc^^^ 

TGAAATGAAAGATTTTGGATACAAGACTAAGCTGTCCCAGGGAAGTCTAA^ 

caagcctgtttcactttcccaagaagcagaactcactanaaaatgatgagcagccca 

CGACAGGCAGGCTCAGAAGTGGACATGCCTCCCTTCTCCTGATGGCTNCCAT^ 

SGGATTTrATGGCATGAACTGAAGCGTTTGGGGGTCTGGAGTAAGTTTA^^ 

TTAGGTAAAGCTTGTATAAATTGTATTTTTGCTTTACCCGATGAGAAAAAAAATATTN 

AAGACCTGGTAGCTTCAATATTCAAGAAAAATATTTTTCATNTCACCCG 
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ReflO.l 

Sequence of BAC4 using primer C2S11, which spans nucleotides 3167-3186 of the cDNA. Exon 
sequence is underlined and represents nucleotides 3231-3296. 

^q.-^ A ^nTnn a fZi-nNnn a Mf-r- A nr^d A r A A rrXN A A rrTNCTTAAACTGTACTCGGATN 

AATTTGGTTCTGAAAATGTCAAAATGATACAGGATTCTGGCAAG GTATTGACCATGTT 

TGGANAAGTTTCATAGCAATGTAATGTTGTGATNCGATTACATATNATATATTTTTAA 

ATGTNTATAGAAAAAAACACANGAAAAATATTAAGGATTGTTGGCCCGTGAGTGGCA 

GGTGTATNTTCTTNCTGATCCTTTAGNGCTTTCCATTACATGCNTGACATTAAAAAAA 

NCTTTATCGCCTAATTTTTGAAACATCTAATTTTACAAAATAATTAACCGTNTGGCCAN 

GNATATTNTCATTTTTAGGNCCAGCTATTTAGAAACTCTGACANAAATGAGGGGCTGT 

GGCTTNCCTNCCTNNACTTGNCCCTCTTTCNNGNATGTACCACATGAACTTGNCNCCT 

CTTTCNNCTNACCGGGTGGCATGTTANAGGACAGGTTGAAACCNCANTNGGGCNGGA 

NTTNGGTNNAATTGGGACACAATGGTACNANGCTCTATNGGAATNGAAACTCTCCCN 

ACNNNCNGTGNNCCNTGGGGAAAATGNGNCNNATTCATTTTN 

Refll.l 

Sequence of BAC4 using primer C2S12, which spans nucleotides 3474-3493 of the cDNA. Exon 
sequence is not found within this sequence. Since the primer is directed against exon sequence we 
presume that sequence derived from C2S9 is intron sequence 

AGNANNGTTNNGCAGCTGCANNTCTGGACCCANAGGCCGCANGGGCACGAGCCNGG 

ACACGCTCGGCAAAGAGCTGTCCAGAGGGATTCAGAAGCTTCAGGACTGGAAGGGTC 

TTTCGAGCTCAGTTAGCCACCCCCACACCCATTTCAGTTTCACATTTATCTAGTGCTT 

CCTTTTGAATACTTGGGATGTTTTTCTGTTGATCTGTTGGCACTTCCTTCTTCCACAA 

GACCAGAAGCTCATATCCAATCTAAGGTCACTTACCCTTCTGAGAATCTGATGAAAAT 

GGCGTGCCTTATGTGCCTAGATGCTTTTGCACACAGTCTAAGGTGACTTATGGACTCC 

AGGTCCAGCAGCCACACCCAGTCCTGGGTCTCCGCACAGGGAGGGACCCGTCTTCAC 

ACACCTGTCTCAGGTTCTAGCATTGGGCTGCTTCAGCGGTCTCAGGCTGTGAGTAAA 

TGGGATGTGAGCTTGGATCGCCCCACGCTGTTGNCCCCCGGGGGGCTTGGCCAGCTG 

GCCACTTNGAAATGCCTCCTTTTGCCCAGGAAAGCTCACTGCATTTCAATGGGGNTTN 

TCCACGAAGTTCANCTTTANGGG 
Ref 12.1 

Sequence of BAC4 using primer C2S13, which spans nucleotides 3645-3664 of the cDNA. Exon 
sequence is underlined and represents nucleotides 3683-3699. 

A^:NAA^;r,T^fN^TrANTNAANN CAGCGTGAGNGTTCAGGT GAGCCAGGCACAGCAGGC 

CGGAGGGCAGCAGGGGACGTCCTTGCCCCTGGGTGACTTGAGAGTCGTTTCCACTAA 

CAAGGTCTACTTGAGAGCCTCGGTTTACCAAGTGATCCCTGCTCCCTTCCCCCAACGT 

NTGTGACATTTCTCCTGATATCAGAGGGGGAGGAAACCTCATGATCCCTGCCCCCCG 

CCCCATGAGGACTGACTGTGGGGACAAAGAGCCAGATCTCATAGACTACCCTGATTT 

GTCAGTATTTGGGGAATTCTGGGTGCCTGATTAGAAGCATCAAGACTCTTCTAAATNC 

AAAGAAGTGTGGAGAGCAGTAGATTTTCCTATAAAACTGGTGTTGCTGGTTTCTATGA 

AAATTGGATCCAAAAAAAGTCCTTAAGTTTACCCTCTTAATGGNATCTTTTGATTAAT 

GGAATTCATTATTTTAATATAGCCCAATCAATCCAATTTTTCTTTATTGGTAGCATTTT 

TATGTTCTCTTTAAAAAAATCTTGGNCTACCTCCAAAATTTCACAGATGTTCTCCTAG 

GGTTTTCCTCCTTTTGGTTCAAGCATCCCATTCAANGTCTTGCAGTCCATTCTGGGG 
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IVvl. x«J«X 

Sequence of BAC4 using primer C2S14, which spans nucleotides 4289r4308 of the cDNA. Exon 
sequence is underlined and represents nucleotides 4321-4448. 

GACTTANATTTATTCTTCCTTGCAOAOTAGTGTTAGAATA CATGGCCTACAGAAAAAA 

AAGGTTCTGGGATCTACATGGC AGGO A GGGCTCr ACTGACATTGATGCCTGGGGGAC 

CTTTTGCCTCGAG GCTGAGCTGGAAAATCTTGAAAATATTTTTTTTTTCCTGTGGCAC 

ATTCAGGTTGAATACAAGAACTATTTTTGTGACTATGTTTTTGATGACCTAAGGGAAC 

TGACCATTGTAATTTTTGTACCANTGAACCANGAGATTTAAGTGCTTTTATATTCATTT 

CCTTGCATTTAAGAAAATATGAAAGCTTAAGGAATTATGTGAGCTTAAAACTAGTCAA 

GCANTTTAGAACCAAAGGCCTATNTTNATAACCGCAACTATGCTNAAAAGNACAAAGT 

AGTACAGNATATTGNTATGTACATATCATTTGGTAATACACNCCNGGCNTTCTGTACA 

TATATGTATTACATTTCTACNTTTTTAATACTCCCNTGGGCTTATGCCNTTAAGGTTAA 

NTTGNGATAAATTTNGGCTGTTCCNGTNTATNCNATACNCTTTT 



Refl4.1 „ 

Sequence of BAC4 using primer C2AS15, which spans nucleotides 4680-4700 of the cDNA. Exon 
sequence is underlined and represents nucleotides 4660-4683. 

ATGAGAATGTAAT ACATATATGTACAGAATGCCAGGACTGTATTAACAATGATATGTA 

CATAACAATATACTGTACTACTTTGTACTTTTCAGCATAGTTGCGGTTATTAATATAG 

GCCTTTGGTTCTAAACTGCTTGACTAGTTTTAAGCTCACATAATTCCTTAAGCTTTCAT 

ATTTTCTTAAATGCAAGGAAATGAATATAAAAGCACTAAATCTCCTGGTTCACTGGTA 

CAAAAATTACAATGGTCAGTTCCCTTAGGTCATCAAAAACTAGTCACAAAAATAGTTC 

TTGTATTCAACCTGAATGTGCCACAGGAAAAAAAAAATATTTTCAAGATTTTCCAGCT 

CAGCCTCGAGGCAAAAGGCCCCCAGGCATCAATGTCAGNGCAGCCCTCCTGCCATGT 

AATAAAATCTAAAGNCCAGCTCAAGAGTGCTACCACACCTTTGTTAAGACACAATGAA 
AACTTTGGATATTGGCAGGNGAGATTTAAAAAAAAATGTGCCCTTTCTTACCACTCCT 
ATAGNAAAGTCTGGTTAAGAAATAACCGTTGGTCTTTATTTTCCTTTTNTTTCCCCTTC 

CCTTGGGNCTTCCTGGGGCTCGG 
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TQAMDRSCNRMSSHTETSSFLQTLTGRLP TKKLraEEIAI^WWCSG--SVR E 



Cadherin 



KLLRYSWFFFDVIilKSMAQHLIENSKVKLI 
KLLKYSWFFFDVIilKSMAQHLIENSKVKLI 


savac 

rnqfI 

RNQP 


je 

FPASYHHAAETWNMLMPHITQKFGD 
FPAS YHHAVE TWNMLiMPHI TQKFRD 


KLLKYSWFFFEIIAKSMATYLI£ENKIKL1 
HVLKbSWFFFAI ILKSMAQHLXDTNKIQLI 
SALQQAWFFFEIihlj^SMVHHLYFNDKIiEAI 


HGQP 
RPQP 
RKSP 


FPKAYHHALHSLFLAIT- IVESQYAE 
FPESYQNEIiDNIiVMSn^SDHVTWKYKD 
FPERFMDDIAAIiVSTIASDIVSRFQK 



Ref . 



6.1 

1.2/1.2/2.1/2 



NPEASKNANHSLAVFIKRCFTFMDRGFVFKQIN NYIS---CFAPGD^TLFEYKFEH^ 2 . 1 

npeasioJanhsiavfikrcfte^^ 

TPKESRN^mYSLASHi^ n i 

i^T^ATHSVARFX^^ NYIS— MFSSGDLKTLCQYKFDFL 7.1 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



RWCNHEHYIPI^PM— -PFGKGRIQR $^nT ""d^Il^If 

RWCNHEHYIPIOTliPM PFGKGRIQR"-------~"YQDLQL-^ 

Qiiciiil^HYi^i^iii^""-^^^ 
RIICSHEHYVTI^ 



4.1/4.2 



3.1 



8.1 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



TrrT.T.TENVORINVRDVSPFPVNAG-MTVKDESLALPAWPL^^ 

lylplfgS^^ 



LYLPFVGLLLENIQRLAGRDTLYSCAAMPNSASRDEFPCG ^'^^^"^""^S!!:! 

LWLYSSlLDNMiRIYLKDLYPFT^ 

LYLPLIGIIMETVPQLYDFTETHNQRGRPICIATDDYESE SG SMX£» 

LYLPIiVGIILDALPQLCDFTVADTRRYR TSGSDEEQE- 



GA GAIT 



KDLLGAI 



SGIASPYTTSTPNXNSVRNADSRGSLISTDSGNSLPERNSEKSNSLDB 



IQQSS 



9.1 



5.1/5.2 



TnT^TAYGSFONG--------- HGIKREDSRGSLIP-EGATGFPDQGNTGEN TRQS 

QW^IA^S™- LKTSG-IVLsiLPYKQ™ 2 . 1 



10.1 
3.1 
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HC2A TLGNSVVRCDKIJ)QSEIKSIJMCFLYILKSMSDDALFTYVm-KASTSEU4DFFTISEV 

KIAA TLi 
rat 



Ref. 



GNSVVRCDKLDQSEIKSLmCFLYILKSMSDDALFTYWN-KASTSEIMDFFTISEVCL 



HC4 STRSSVSQYNRII3QYEIRSLmCYLYIVKMISEDTIiTYWN-KVSPQELINII.IIXEVCL 

HCl ALIGSTLRFDRLDQAETRSLIMCFLHIMKTISYETLIAYWQ-RAPSPEVSDFFSIljDVCL 11.1/11.2 

HC3 TFSAESSRSLLICLLWVLKN-ADETVLQKWFTDLSVLQLNRLIJaLLYLCV 

HC5 MIJ?ADTTROTiaCFLWIMKN-M)QSI.IRK»n:ADLPSTQLNRILDLLFICV 

HC2A HQ512YMGKRYIABNQE6LG-PIVHDRKS '^'^^^l^l^ ^'^ 

KIAA HQFQYMGKRYIAR Z—ZZTl 



HC4 FHFRYMGKRNIARVHDAWLSKHFGIDRKS QTMPALRNRSGVM 

HCl QNFRYLGKRNIIRKIAAAF— KFVQSTQNNGTUCGSNPSCQTSGLLAQWMHSTSRHEGHK 

HC3 SCFEYKGKKVFERMNSLTFK— KSKDMRAK LEEAILGS IGARQEMV 

HC5 LCFEYKGKQSSDKVSTQVl^—KSRDVKAR LEEALLRGEGARGEMM 



HC2A HARLQQI, GSUDNS LTFNHSYGHSDADVLHQSLLEANIATEVC 

KIAA HARIQQL GSLDNS LTFiraSYGHSDADVLHQSLLEANIATEVC 

_x. — — — • ~" 

HC4 OARIQHL SSLESS FTLNHSSTTTEADIFHQALLEGNTATEVS 

HCl QHRSQTLPIIRGK NALSNPKL LQMLDNTMT^NSNEIDIVHHVDTEANIATEGC 12 1/12 . 2 

HC3 RRSRGQiiERSPSGSAFGSQENLRWRKDMTHMRQNTEKLDKSRAEIEHEALIDGNLATEAN 6.1/6.2 

HC5 KRRAP^RFP GLNENLRWKKEQTHWRQANEKLDKTKAELDQEALISGNIATEAH 

HC2A LTAIiDTLSLFTl- ..^ 

KIAA LTALDTLSLFTLAFKNQLIADHGIMPUIKKVFDVYLCFLQKHQSETALKNVFTALRSLIY 

KLSRGHSPLMKKVFDVYLCFLQKHQSEMALKNVFTALRSLIY 

HC4 LTVIJDTISFFTQCFKTHFLNNDGHNPLMKKVFDIHLAFLKNGQSEVSLKHVFASLRAFIS 

HCl LTILDLVSLFTQTHQRQLQQCDCQNSLMKRGFDTYMLFFQVNQSATALKHVFASLRLFVCI 13.1 

HC3 LIIIJDTLEIVVQTVS--VTES--KESILGGVLKVLLHSMACNQSAVYLQHCFATQRALVS 
HC5 i.iiii)i^ENIIGASS--ALDC--KDSLLGGVLRVLVNSLNCDQSTTYLTHCFAjrLRALIA 3 . 1 



IAFK^JQL^Ja^HGHNPIMKKVFDVYLCFLQKHQSETALKNVFTALRSLIY 7 . 1 



KFPSTFYEGRADMCAALCYEILKCCNSKLSSIRTEASQLLYFLMRNNFDYTGKKSFVRTH 
KFPSTFYEGRADMCAALCYEXLKCCNSKLSSIRTEASQLLYFIMRNNFDYTGKKSFVRTH 
KFPSTFYEGRADMCASLCYEVLKCCNSKLSSIRTEASQLLYFLMRNNFDYTGKKSFVRTH 
KFPSAFFKGRVNMCAAFCYEVLKCCTSKISSTRNEASALLYLIMPNNFEYTKRKTFLRTH 
iCGSFCYEVLKCCNHRSRSTQTEASALLYLFMRKNFEFNKQKSXVRSH 



HC2A 
KIAA 
rat 
HC4 

HCl KFPSAFFQGPADLCGSFCYEVLKCCNHRSRSTQTliAaAi.i^ii^cixu«^ri:,= i.-.i^«*^»....- — • , « 

HC3 i^PELLFEEETEQCADLCLRLLRHCSSSIGTIRSHPSASLYLLMRQNFEIGN—NFARVK 7.1/7.2 

HC5 ra-GDLLFEEEVEQCFDLCHQVLHHCSSSMDVTRSQACATLYLLMRFSFGATS— NFARVK 

HC2A LQVIISVSQLIADWGIGETRFQQSLSIINNCANSDRLIKHTSFSSDVKDLTKRIRTVLM 

KIAA LQVIISVSQLIADWGIGGTRFQQSLSIINNCANSDRLIKHTSFSSDVKDLTKRIRTVLM 

rat LQVIISLSQLIADWGIGGTRFQQSLSIINNCANSDRLIKHTSFSSDVKDLTKRIRTVIM 

HC4 LQiiiAVSQLIADVALSGGSRFQESLFIINNFANSDRPMLARAFPAEVKDLT^lRTV^ 

HCl lSLiKAVSQLIAD-AGIGGSRFQHSLAITNNFANGDKQMKN^NFPAEVKDLTKRXRTVIM 14.1/"- 

HC3 MQVPMSLSSLVGTSQNFNEEFLRRSLKTILTYAEEDLELRETTFPDQVQDLVFNLHMILS 

HC5 MQVTMSLASLVGRAPDFNEEHLRRSLRTILAYSEEDTAMQMTPFPTQVEELLCNLNSILY 
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Transmembrane Ref . 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



ATAQMKEHENDPEMLVDLQYSIAKSYASTPEIJ^KTWLDSMARIHVKNGE LSEAAMCYVHV 
ATAQMKEHENDPEMLVDLQYSIAKSYASTPELRKTWLDSMARIHVKNGrLSEAAMCYV^ 
ATAQMKEHENDPE^ILVDLQYSIJ^SYASTPELRKTWIlDSMARIHVKNGE LSEAAMCYVHV 
ATAQMKEHEKDPEMLIDLQYSIAKSYASTPELRKTWIJDSMAKIHVKNGEFSEAAMCYVHV 

ataqmkehekdpemlvdlqysiansyastpelrrtwlesmakiharngelse|aamcyihi 

DTVKMKEHQEDPEMLIDI2^YRIAKGYQTSPDLRI.TWLQNMAGKHSERS^HAEAAQCLVHS 
DTVKMREFQEDPEMIM5IMYRIAKSYQASPDX.RLTm.QNM^ 



SH3 



TAIiVAEYL TRKGV- 
TALVAEYL URKEA- 
TALVAEYL TRKEAD 



AALVAEFL: 
AALIAEYLi< 
AALVAEYL : 
AALVAEYL 3MLED- 



IRKKL- 
KRKGYV 
3MIiED- 



FRQGCTAFRVI TPN 
-VQWEPPLLPHSHSACXiRRSRGC^/FRQGCTAFRVI TPN 
LALQREPPVFPYSHTSCQRKSRGGJ^FRQGCTAFRVXTPN 

FPNGCSAFKKITPN 



KRKGYWKVEKICtTASLLSEDTHPCDSNSLLTTPSGGS^dFSMGWPAFLSXTPN 

RKiXPVGCVTFC(NISSN 

HS r LPVGSVSFQNI S SN 



16.1/16.2 



8.1/8.2 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

C5 



IDEEASMMEDVG^QD VHFNEDVLfcdELLEQCADGLWKAERYELIADIYKLIIPI 

IDEEASMMEDVGMQD VHFNEDVIMELLEQCADGLWKAERYELIADIYKLIIPI 

IDEEASMMEDVOIQD VHFNEDVLMELLEQCADGLWKAERLRAGLLTSINSSSP 

IDEEGAMKEDAGMMD VHYSEEVLLELLEQCVNGLWKAERYEIISEISKLIGPI 

IKEEGAAKEDSGMHD tPYNe|nILVEQLYMCGEFLWKSERYELIADVNKPIIAV 

VLEESAVSDDWSPDEEGICSGKYFTESGLVGLLEQAAASFSMAGMYEAVNEVYKVLIPI 
VLEESWSEDTLSPDEDGVCAGQYFTESGLVGLLEQAAELFSTGGLYETVNEVYKLVIPI 



ITAM 



ITAM 



ITAM 



ITAM 



YENRREFENLTQ\ YRTLpGApTKljliEVMHTKKRLLG- 
FEKQRDFKKLSDI YYDI 
HEANRDAKKLST3 HGKLbEAlFSKI 



YEKR^UD , . 

YEKRRDFERLAHI YDTL fIRA iTSKV CEVMHSGRRIiGO^fE^V^FGQAAQYQFTDSETDVE 

SMKSGGTLETTHI YDTL ^RP^SKV CEVITR A AGSWDLLPGGLFGQ 

TFFRVAFYGQ 

itlRStYLKVJ^WNSEKRLFG 

: /hqIstgwermfg 

LEAHREFRKLTLTllSKLQbAEjDSIV «CDH — KRMFG 



-RY^RVAFYGQ 
-TlfFRVGFYG- 
TYFKVGFFG- 



8.1 



17.1/17.2 



9.1 



9.1 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



-FFEDEDGKEYIYKEPKLTPLSEISQRLLKLYSDKFGSENVKMIQDSGK^PKDLDSKYA 10 . 1 
GFFEDEDGKEYIYKEPKLTPLSEISQRLLKLYSDKFGSENVKMIQDSGKVNPKDLDSKYA 
GFFEDEDGKEYIYKEPKLTPLSEISQRLLKLYSDKFGSENVKMIQDSGKVNPKDLDSKFA 
SFFEEEDGKEYIYKEPKLTGLSEISLRLVKLYGEKFGTENVKIIQDSDKVNAKELDPKYA 

gffeeeegkeyiykepkltglseisqrllklyadkfgadnvkiiqdsnkvnpkdldpkya 

TKFGDLDEQEFVYKEPAITKIAEISHR^EGFYGERFGEDVVEVIKDSNPVDKCKLDPNKA 10 . 1/10 . 2 
SKFGDLDEQEFVYKEPAITKLPEISHRLEAFYGQCFGAEFVEVIKDSTPVDK|rKLDPNKA 



4.1 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



YIQVTHVIPFFDEKELQERKTEFERSHNIRRFMFEMPFTQTGKRQGGVEEQCKRR|cILTA 11.1/11.2 

YIQVTHVIPFFDEKELQERKTEFERSHNIRRFMFEMPFTQTGKRQGGVEEQCKRRTILTA 

YIQVTHVTPFFDEKELQERKTEFERCHNIRRFMFEMPFTQTGKRQGGVEEQCKRRTILTA 

HIQVTYVKPYFDDKELTERKTEFERNHNISRFVFEAPYTLSGKKQGCIEEQCKRRTILTT 

YIQVTYVTPFFEEKEIEDRKTDFEMHHNINRFVFETPFTLSGKKHGGVAEQCKRRTILTP 18 . 1 

YIQITYVEPYFDTYEMBODRITYFDKNYNLRRFMYCTPFTLDGRAHGELHEQFKRKTILTT 

YIQITFVEPYFDEYEMBa)RVTYFEKNFNLRRFMYTTPFTLEGRPRGELHEQYRRNTVLTT 
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rat 

HC4 

HCl 

HC3 
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HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



Coiled-Coil 1 ■ 

YVKKRIPVMYQHHTOI^PtEVAIDEMSKKV'AELRQIiCSSAEVDMIKIiQ 



Ref 



IHCFPY 

IHCFPYVKKRIPVMYQHHTDIi^pfcEVAIDEMSKKVAEIJlQLCSSAEVDMI 
IHCFPYVKKRI PVMYQHHTDLNP CEVAIDEMSKKVAELHQLCS S AEVDMIKLQLKLQG SV 
SNSFPYVKKRIPINCEQQINLKP CDGATDEIKDKTAELQKLCSSTDVDMIQLQLKLQG ^ 
SHLFPYVKKRIQVI SQS STELNP EEVAIDEMSRKVSEI^QLCTMEEVDMI SLQLKLQG SV 
SHAFPYIKTRVNVTHKEt IIIiTP CEVAIEDMQKKTQELAFATHQDPADPKMLQMVLQG SV 
MHAFPYIKTRISVIQKEEFVLTP [EVAIEDMKKKTLQIAVAINQEPPDAKMLQMVLQG|SV 



11.1 



Coiled-Coil 2 



SVQVnAGPLAYARAFIJ^DTNTKRYPDNKVKLIiKEVFRQFVEACGQ.?!^ 

svqvnagpiayarafijddtntkrypdnkvkllkevfrqfveacgq;^ iavnerlikedqle 
svqvnagplayaraflddtntkrypdnkvkllkevfrqfveacgq^ lavnerlikedqle 

SVQVNAGPLAYARAFIJm SQASKYPPKKVSELKDMFRKFIQAC S li^ LEIOTRLIKE^^ 
SVKVNAGPMAYARAFLEETNAKKYPDNQVKLLKEIFRQFADACGQA ldvnerlikedqle 

gttvnqgplevaqvflseipsdpklfrhhnkiju:.cfkdftkrced;^lrknksl 

/^yvrp.ncrQr.pT.ir^72inwrAV.TPAnPKI.YRHHNKI.RLCFKEFIt^ 



11.1/12.1 



Coiled-Coil 2 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



-VLPNSLHIFNAISGTPTSTMVHGMTSS 



I yqeemkanyremakelse imheqk :pleekts 

YQEEMKAfcTYREMAKEIiSE IMHEQIi( ; 

YQEEMKANYRE IRKELSD I IVPRK :PGEDKRATKFPAHLQRHQRDTNKHSGS: 
YHEGIiKSNFRDMVKELSDIIHEQi: .QEDTMHSPWMSNTLHVFCAISGTSSDRi 

yqeelrshykdmlselstvmi|ieqi! :grddlsk 

YQRELG KLSS i ?Z 

YQQEIJCKNYNKLKEinJlPMIERKI^'ELYKPIFRVESQKRPSFHRSSFRKCETQLSQGSZ 



pFIIiS 
[YqSPRYA 

RGVDQTCTRVISKATPALPTVSISS 19.1 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



PBM 

Issw^ - 



CVTLPHEPHVGTCFVMCKLRTTFRANHWFCQAQEEAMGNGREKEPWTVEFNSRFYRSWGK 

ivb— ""IIIIIIII" 

ISAEV^ " " " 



HC2A 

KIAA 

rat VHIFF 

HC4 

HCl 

HC3 

HC5 
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genomic DNA 

FIG. 7 



BAG 6 DNA 
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I 10 I 20 I 30 I 40 

1 AATTGTAATA CGACTCACTA TAGGGCGAAT TGGGTACCGG 
81 GAATTCGGCA CGAGTTTTAC ACCATCACCA AAACCCAGAA 
161 ATGAAAAGCA CCACCTGTTG CTCACATTCT TCCATGTCAG 
241 GTCGTTGAAA CCCAAGTTGG CTACTCCTG6 CTTCCCCTCC 
321 GGTCTCGGCG AACCTTCCTT CGGGCTATCT TGGCTACCAA 
401 GGGTAGATGG AGGCAAGCCA CTGCTGAAAA TTTCCACTCA 
481 ATTTTTTCCA GTACTGTCAG AAAACCGAAT CTGGAGCCCA 
561 CATGCGATG6 AAGGCCACGT GATGATCGCC TTCTTGCCCA 
641 ACAGGAAGAA GTCGCGGTTA ACGTGACTCG GGTCATTATT 
721 ACTTGAGGTC ATATGTTAAG TACGCGTATA AGGCTGAGCC 
801 ACCAAATCCA TGACCACGAT TCTCAAGCCT TCTGCCGATT 
881 CTTTGATGTA CTGATCAAAT CTATGGCTCA GCATTTGATA 
961 CTGCATCCTA TCATCATGCA GCGGAAACCG TTGTAAATAT 
1041 GAGGCATCTA AGAACGCGAA TCATAGCCTT 6CTGTCTTCA 
1121 CAAGCAGATC AACAACTACA TTAGCTGTTT TGCTCCTGGA 
1201 GTGTAGTGTG CAACCATGAA CATTATATTC CGTTGAACTT 
1281 GACCTCCAGC TTGACTACTC ATTAACAGAT GAGTTCTGCA 
1361 GACAGCCCTC CAGGAGTTCC GGGAGGTCCG TCTGATCGCC 
1441 ATGACAGATA TGCTTCAAGG AGCCATCAGG CAAGGATAGC 
1521 GTCCAGCGGA TCAATGTGAG GGATGTGTCA CCCTTCCCTG 
1601 ACCAGCTGTG AATCCGCTGG TGACGCCGCA GAAGGG2^GC 
1681 TCTCCGGCAT TGCTTCTCCA TATACAACCT CAACTCCAAA 
1761 ATAAGCACAG ATTCGGGTAA CAGCCTTCCA GAAAGGAATA 
1841 CACATTGGGA AATTCCGTGG TTCGCTGTGA TAAACTTGAC 
1921 TCTTAAAGAG CATGTCTGAT GATGCTTTGT TTACATATTG 
2001 ATATCTGAAG TCTGCCTGCA CCAGTTCCAG TACATGGGGA 
2081 AGTTCATGAT CGAAAGTCTC AGACATTGCC TGTTTCCCGT 
2161 GCAGCCTGGA TAACTCTCTC ACTTTTAACC ACAGCTATGG 
2241 GCCAACATTG CTACTGAGGT TTGCCTGACA GCTCTGGACA 
2321 GGCCGACCAT GGACATAATC CTCTCATGAA AAAAGTTTTT 
2401 CGGCTTTAAA AAATGTCTTC ACTGCCTTAA GGTCCTTAAT 
2481 ATGTGTGCGG CTCTGTGTTA CGAGATTCTC AAGTGCTGTA 
2561 GCTCTACTTC CTGATGAGGA ACAACTTT6A TTACACTGGA 
2641 CTGTCAGCCA GCTGATAGCA GACGTTGTT6 GCATTGGGGA 
2721 GCCAACAGT6 ACCGGCTTAT TAAGCACACC AGCTTCTCCT 
2801 AATGGCCACC GCCCAGATGA A6GAGCATGA GAACGACCCA 
2881 ATGCCAGCAC GCCCGAGCTC AGGAAGACGT GGCTCGACAG 
2961 GCAGCAATGT GCTATGTCCA CGTAACAGCC CTAGTGGCAG 
3041 CGCCTTCAGG GTCATTACCC CAAACATCGA CGAGGAGGCC 
3121 ACGAGGATGT GCTGATGGAG CTCCTTGAGC AGTGCGCAGA 
3201 ATCTACAAAC TTATCATCCC CATTTATGA6 AAGCGGAGGG 
3281 GGAACCCAAA CTCACACCGC TGTCGGAAAT TTCTCAGAGA 
3361 TCAAAATGAT ACAGGATTCT GGCAAGGTCA ACCCTAAGGA 
3441 ATCCCCTTCT TTGACGAAAA AGAGTTGCAA GAAAGGAAAA 
3521 T6AGATGCCA TTTACGCAGA CCGGGAAGAG GCAGGGCGGG 
3601 TACACTGCTT CCCTTATGTG AAGAAGCGCA TCCCTGTCAT 
3681 ATTGACGAGA TGAGTAAGAA GGTGGCGGAG CTCCGGCAGC 
3761 CAAACTCCAG GGCAGCGTGA GTGTTCAGGT CAATGCTGGC 
3841 CAAAGCGATA TCCTGACAAT AAAGTGAAGC TGCTTAA6GA 
3 921 GCGGTAAACG AACGTCTGAT TAAAGAAGAC CAGCTCGAGT 
4001 GGAGCTTTCT GAAATCATGC ATGAGCAGAT CT6CCCCCTG 
4081 TCAACGCCAT CAGTGGGACT CCAACAAGCA CAATGGTTCA 
4161 ATGGCCCGTG TGTGGGGACT TGCTTTGTCA TTTGCAAACT 
4241 CAGGGAGGAC CAAGG6GAAG GGGAGAGAAA GGAAATAAAG 
4321 A6AAGGTGCA CATATTTTTT TAAATCTCAC TGGCAATATT 
4401 TCTTGAGCTG GACTTAGATT TTATTCTTCC TTGCAGAGTA 
4481 ATCTACATGG CAGGGAGGGC TGCACTGACA TTGATGCCTG 
4561 AATCAGGGTA CAGAACTTAC TAGTTTTGTC TAGGAGTATG 
4641 ATAGAGCAAG AATAGTGAGC TAACTGAGCT AGACACTCAA 
4721 TCATCGACTC CGGGACGGTC ATATATGTAT TACATTTCTA 
4801 TGTGATAAAT TTGTGCTGGT CCAGTATATG CAATACACTT 
4881 TGTATACAAG TCTTTACT 

I 10 I 20 I 30 I 40 



I 50 1 60 I 70 I 80 

GCCCCCCCTC GAGGTCGACG GTATCGATAA GCTTGATATC 80 
TTTTATGATG AGATTAAAAT AGAGTTGCCC ACTCAGCTGC 160 
CTGTGACAAC TCAAGTAAAG GAAGCACGAA GAAGAGGGAT 240 
TGAAAGACGG AAGGGTGGTG ACAAGCGAGC AGCACATCCC 320 
GAGCTTGGGA TGGGCAGGCA TTATGGTCCG GAAATTAAAT 400 
TCTGGTTTCT ACAGGGATAC TCAGGATCAG CATTTACATA 480 
AGCCTTAGGA AACGAACTTG TAAAGTACCT TAAGAGTCTG 560 
CTATCCTAAA CCAGCTGTTC CGAGTCCTCA CCAGAGCCAC 640 
CATGTGGTTG CCCAGTGCCA TGAGGAAGGA TTGGAGAGCC 720 
ATATGTTGCC TCTGAATACA AGACAGTGCA TGAAGAACTG 800 
TCCTCACCAG CAACAAACTA CTGAGGTACT CATGGTTTTT 880 
GAGAACTCCA AAGTTAAGTT GCTGCGAAAC CAGAGATTTC 960 
GCTGATGCCA CACATCACTC AGAAGTTTGG AGATAATCCA 1040 
TCAAGAGATG TTTCACCTTC ATGGACAGGG GCTTTGTCTT 1120 
GACCCAAAGA CCCTCTTTGA ATACAAGTTT GAATTTCTCC 1200 
ACCAATGCCA TTTGGAAAAG GCAGGATTCA AAGATACCAA 1280 
GAAACCACTT CTTGGTGGGA CTGTTACTGA GGGAGGTGGG 1360 
ATCAGTGTGC TCAAGAACCT GCTGATAAAG CATTCTTTTG 1440 
CACCCTCTAC CTGCCTCTGT TTGGTCTGCT GATTGAAAAC 1520 
TGAACGCGGG CATGACCGTG AAGGAT6AAT CCCTGGCTCT 1600 
ACCCTGGACA ACAGCCTGCA CAAGGACCTG CTGGGCGCCA 1680 
CATCAACAGT GTGAGAAATG CTGATTCGAG AGGATCTCTC 1760 
GTGAGAA6AG CAATTCCCTG GATAAGCACC AACAAAGTAG 1840 
CAGTCTGAGA TTAAGAGCCT ACTGATGTGT TTCCTCTACA 1920 
GAACAAGGCT TCAACATCTG AACTTATGGA TTTTTTTACA 2000 
AGCGATACAT AGCCAGGAAC CAGGAGGGGT TGGGACCCAT 2080 
AACAGAACAG GAATGATGCA TGCCAGATTG CAGCAGCTGG 2160 
CCACTCGGAC GCAGATGTTC TGCACCAGTC ATTACTTGAA 2240 
CGCTTTCTCT ATTTACATTG GCGTTTAAGA ACCAGCTCCT 2320 
GATGTCTACC TGTGTTTTCT TCAAAAACAT CAGTCTGAAA 2400 
TTATAAGTTT CCCTCAACAT TCTATGAAGG GAGAGCGGAC 2480 
ACTCCAAGCT GAGCTCCATC AGGACGGAGG CCTCCCAGCT 2560 
AAGAAGTCCT TTGTCCGGAC ACATTTGCAA GTCATCATAT 2640 
^CCAGATTC CAGCAGTCCC TGTCCATCAT CAACAACTGT 2720 
CTGATGTGAA GGACTTAACC AAAAGGATAC GCACGGTGCT 2800 
GAGATGCTGG TGGACCTCCA GTACAGCCTG GCCAAATCCT 2880 
CATGGCCAGG ATCCATGTCA AAAATGGCGA TCTCTCAGAG 2960 
AATATCTCAC ACGGAAAGGC 6TGTTTAGAC AAGGATGCAC 3040 
tcStStoG AAGACGTGGG GATGCAGGAT GTCCATTTCA 3120 
TgScTCTGG AAAGCCGAGC GCTACGAGCT CATCGCCGAC 3200 
ATTTCTTTGA AGATGAAGAT GGAAAGGAGT ATATTTACAA 3280 
CTCCTTAAAC TGTACTCGGA TAAATTTGGT TCTGAAAATG 3360 
TCTGGATTCT AAGTATGCAT ACATCCAGGT GACTCACGTC 3440 
SLgtttga GAGATCCCAC AACATCCGCC GCTTCATGTT 3520 
GTGGAAGAGC AGTGCAAACG GCGCACCATC CTGACAGCCA 3600 
GTACCAGCAC CACACTGACC TGAACCCCAT CGAGGTGGCC 3680 
TGTGCTCCTC 6GCCGAGGT6 GACATGATCA AACTGCAGCT 3760 
rrjiCTAGCAT ATGCGCGAGC TTTCTTAGAT GATACAAACA 3840 
^?^G ^TTGTGG AAGCTTGCGG TCAAGCCTTA 3920 
^LgGAAGA AATGAAAGCC AACTACAGGG AAATGGCGAA 4000 
^GGAGAAGA CGAGCGTCTT ACCGAATTCC CTTCACATCT 4080 
CGGGATGACC AGCTCGTCTT CGGTCGTGTG attacatctc 4160 
CAGGATGCTT TCCAAAGCCA ATCACTGGGG AGACCGAGCA 4240 
AACAACGTTA TTTCTTAACA GACTTTCTAT aggagttgta 4320 
CAAAGTTTTC ATTGTGTCTT AACAAAGGTG TGGTAGACAC 4400 
^tIgAAT AGATGGCCTA CAGAAAAAAA AGGTTCTGGG 4480 
SgGACCTTT TGCCTCGACT CGTGCCGGAA ATCTGATCGT 4560 
TTGTATGACT AGGATTTGTG CTATTATCTC ATTCAACAAC 4640 
TTAATCCGCT ACTGGCTTCA AGTCAGAACT TTGTCATTAA 4720 
S^^TTAAT ACTCACATGG GCTTATGCAT TAAGTTTAAT 4800 
TAATGGTTTA TTCTTGTCAT AAAAATGTGC AATATGGAGA 4880 

50 I 60 I 70 I 80 
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I 10 I 20 I 30 I 40 

1 ME6HVMIAFL PTII^NQLFRV LTRATQEEVA VNVTRVIIHV 
81 SMTTIUOPSA DFLTSNKLLR YSWFFFDVLI KSMA.QHLIEN 
161 SKNANHSIAV FIKRCFTEMD RGFVFKQINN YISCFAPGDP 
241 QIJ5YSLTDEF CRNHFLVGIi LREVGTALQE FREVRLIAIS 
321 RINVFDVSPF PVNAC3MTVKD ESLALPAVNP LVTPQKGSTIi 
401 TDSGNSLPER NSEKSNSU3K HQQSSTLGNS WRCDKIiDQS 
481 EVCLHQFQYM GKRYIARNQE GLGPIVHDRK SQTLPVSBNR 
561 lATEVCLTAL DTLSLFTLAF KNQLIADHGH NPIMKKVFDV 
641 AALCYEILKC CNSKLSSIRT EASQLLYFI14 W>INFDYTGKK 
721 SDRLIKHTSF SSDVKDLTKR IRTVIMATAQ MKEHENDPEM 
801 MCYVHVTALV AEYLTRKGVF FQGCTAFRVI TPNIDEEASM 
881 KLIIPIYEKR RDFFEDEDGK EYIYKEPKLT PLSEISQRIX 
961 FFDEKEI^ER KTEFERSHNI RREMFEMPFT QTGKRQGGVE 
1041 EMSKKVAELR QLCSSAEVDM IKLQLKLQGS VSVQVNAGPL 
1121 NEKLIKEDQL EYQEEMKANY REMAKELSEI MHEQICPIiEE 
I 10 I 20 I 30 I 40 



I 50 I 60 I 70 I 80 

VAQCHEEGLE SHLRSYVKYA YKAEPYVASE YKTVHEELTK 80 
SKVKLLRNQR FPASYHHAAE TVVNMU^HI TQKFGDNPEA 160 
KTI^YKFEF LRWCNHEHY IPLNLPMPFG KGRIQRYQDL 240 
VLKNLLIKHS FDDRYASRSH QARIATLYLP LFGLLIENVQ 320 
DNSLHKDIiG AISGIASPYT TSTPNINSVR NADSRGSLIS 400 
EIKSLLMCFL YILKSMSDDA LFTYWNKAST SELMDFFTIS 480 
TGMMHARLQQ LGSLDNSLTF NHSYGHSDAD VLHQSLLEAN 560 
YLCFLQKHQS BTALKNVFTA URSLIYKFPS TFYEGRADMC 640 
SFVRTHLQVI ISVSQLIADV V6IGETRFQQ SLSIINNCAN 720 
LVDLQYSIAK SYASTPEUUC TWLDSMARIH VKNGDLSEAA 800 
MEDVGMQDVH FNEDVLMELL EQCADGLWKA ERYELIADIY 880 
KLYSDKFGSE NVKMIQDSGK VNPKDUJSKY AYIQVTHVIP 960 
EQCKRRTILT AIHCFPYVKK RIPVMYQHHT DIOTIEVAID 1040 
AYARAFLDDT NTKRYPDNKV KLLKEVFRQF VEACGQALAV 1120 
KTSVLPNSLH IFNAISGTPT STMVHGMTSS SSW 1195 
1 50 I 60 I 70 I 80 
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I 10 I 20 I 30 I 40 

1 AATTGTAATA CGACTCACTA TAGGGCGAAT TGGGTACCGG 
81 GAATTCGGCA CGAGTTTTAC ACCATCACCA AAACCCAGAA 
161 ATGAAAAGCA CCACCTGTTG CTCACATTCT TCCATGTCAG 
241 GTCGTTGAAA CCCAAGTTGG CTACTCCT6G CTTCCCCTCC 
321 GGTCTCGGCG AACCTTCCTT CGGGCTATCT TGGCTACCAA 
401 GGGTAGATGG AGGCAAGCCA CTGCTGAAAA TTTCCACTCA 
481 ATTTTTTCCA GTACTGTCAG AAAACCGAAT CTGGAGCCCA 
5 61 CATGCGATGG AAGGCCACGT GATGATCGCC TTCTTGCCCA 
641 ACAGGAAGAA GTCGCGGTTA ACGTGACTCG GGTCATTATT 
721 ACTTGAGGTC ATATGTTAAG OACGCGTATA AGGCTGAGCC 
801 ACCAAATCCA TGACCACGAT TCTCAAGCCT TCTGCCGATT 
881 CTTTGATGTA CTGATCAAAT CTATGGCTCA GCATTTGATA 
961 CTGCATCCTA TCATCATGCA GCGGAAACCG TTGTAAATAT 
1041 GAGGCATCTA AGAACGCGAA TCATAGCCTT GCTGTCTTCA 
1121 CAAGCAGATC AACAACTACA TTAGCTGTTT TGCTCCTGGA 
1201 GTGTAGTGTG CAACCATGAA CATTATATTC CGTTGAACTT 
1281 GACCTCCAGC TTGACTACTC ATTAACAGAT GAGTTCTGCA 
1361 GACAGCCCTC CAGGAGTTCC GGGAGGTCCG TCTGATCGCC 
1441 ATGACAGATA TGCTTCAAGG AGCCATCAG6 CAAG6ATAGC 
1521 GTCCAGCGGA TCAATGTGAG GGATGTGTCA CCCTTCCCTG 
1601 ACCawGCTGTG AATCCGCTGG TGACGCCGCA GAAGGGAAGC 
1681 TCTCCGGCAT TGCTTCTCCA TATACAACCT CAACTCCAAA 
1761 ATAAGGACAS ATTCGGGTAA CAGCCTTCCA GAAAGGAATA 
1841 CACATTGGGA AATTCCGTGG TTCGCTGTGA TAAACTTGAC 
1921 TCTTAAAGAG CATGTCTGAT GATGCTTTGT TTACATATTG 
2001 ATATCTGAAG TCTGCCTGCA CCAGTTCCAG TACATGGGGA 
2081 AGTTCATGAT CGAAAGTCTC AGACATTGCC TGTTTCCCGT 
2161 GCAGCCTGGA TAACTCTCTC ACTTTTAACC ACAGCTATGG 
2241 GCCAACATTG CTACTGAGGT TTGCCTGACA GCTCTGGACA 
2321 GGCCGACCAT GGACATAATC CTCTCATGAA AAAAGTTTTT 
2401 CGGCTTTAAA AAATGTCTTC ACTGCCTTAA GGTCCTTAAT 
2481 ATGTGTGC6G CTCTGTGTTA CGAGATTCTC AAGTGCTGTA 
2561 GCTCTACTTC CTGATGAGGA ACAACTTTGA TTACACTGGA 
2641 CTGTCAGCCA GCTGATAGCA GACGTTGTTG GCATTGGGGA 
2721 GCCAACAGTG ACCGGCTTAT TAAGCACACC AGCTTCTCCT 
2801 AATGGCCACC GCCCAGATGA AGGAGCATGA GAACGACCCA 
2881 ATGCCAGCAC GCCCGAGCTC AGGAAGACGT GGCTC6ACAG 
2961 GCAGCAATGT GCTATGTCCA CGTAACAGCC CTASTGGCAG 
3041 CGCCTTCAGG GTCATTACCC CAAACATCGA CGAGGAGGCC 
3121 AC6AGGATGT GCTGATGGAG CTCCTT6AGC AGTGCGCAGA 
3201 ATCTACAIU^C TTATCATCCC CATTTATGAG AAGCGGAGGG 
3281 GGZACCCAAA CTCACACCGC TGTCGGAAAT TTCTCAGAGA 
3361 TCAAAATGAT ACAGGATTCT GGCAAGGTCA ACCCTAAGGA 
3441 ATCCCCTTCT TTGACGAAAA AGAGTTGCAA GAAAGGAAAA 
3521 TGAGATGCCA TTTACGCAGA CCGGGAAGAG GCAGGGCGGG 
3601 TACACTGCTT CCCTTATGTG AAGAAGCGCA TCCCTGTCAT 
3681 ATTGACGAGA TGAGTAA6AA GGTGGCGGAG CTCCGGCAGC 
3761 CAAACTCCAG GGCAGCGTGA GTGTTCAGGT CAATGCTGGC 
3841 CAAAGCGATA TCCTGACAAT AAAGTGAAGC TGCTTAAGGA 
3921 GCGGTAAACG AACGTCTGAT TAAAGAAGAC CAGCTCGAGT 
4001 GGAGCTTTCT GAAATCATGC ATGAGCAGAT CTGCCCCCTG 
4081 TCAACGCCAT CAGTGGGACT CCAACAAGCA CAATGGTTCA 
4161 ATGGCCCGTG TGTGGGGACT TGCTTTGTCA TTTGCAAACT 
4241 CAGGGAGGAC CAAGGGGAAG GGGAGAGAAA GGAAATAAAG 
4321 AGAAGGTGCA CATATTTTTT TAAATCTCAC TGGCAATATT 
4401 TCTTGAGCTG GACTTAGATT TTATTCTTCC TTGCAGAGTA 
4481 ATCTACATGG CAGGGAGGGC TGCACTGACA TTGATGCCTG 
4561 AATCAGGGTA CAGAACTTAC TAGTTTTGTC TAGGAGTATG 
4641 ATAGAGCAAG AATAGTGAGC TAACTGAGCT AGACACTCAA 
4721 TCATCGACTC CGGGACGGTC ATATATGTAT TACATTTCTA 
4801 TGTGATAAAT TTGTGCTGGT CCAGTATATG CAATACACTT 
4881 TGTATACAAG TCTTTACT 
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GCCCCCCCTC GAGGTCGACG GTATCGATAA GCTTGATATC 80 
TTTTATGATG AGATTAAAAT AGAGTTGCCC ACTCAGCTGC 160 
CTGTGACAAC TCAAGTAAAG GAAGCACGAA GAAGAGGGAT 240 
TGAAAGACGG AAGGGTGGTG ACAAGCGAGC AGCACATCCC 320 
6AGCTTGGGA TGGGCAGGCA TTATGGTCCG GAAATTAAAT 400 
TCTGGTTTCT ACAGGGATAC TCAGGATCAG CATTTACATA 480 
AGCCTTAGGA AACGAACTTG TAAAGTACCT TAAGAGTCTG 560 
CTATCCTAAA CCAGCTGTTC CGAGTCCTCA CCAGAGCCAC 640 
CATGTGGTTG CCCAGTGCCA TGAGGAAGGA TTGGAGAGCC 720 
ATATGTTGCC TCTGAATACA AGACAGTGCA TGAAGAACTG 800 
TCCTCACCAG CAACAAACTA CTGAGGTACT CATGGTTTTT 880 
GAGAACTCCA AAGTTAAGTT GCTGCGAAAC CAGAGATTTC 960 
GCTGATGCCA CACATCACTC AGAAGTTTGG AGATAATCCA 1040 
TCAAGAGATG TTTCACCTTC ATGGACAGGG GCTTTGTCTT 1120 . 
GACCCAAAGA CCCTCTTTGA ATACAAGTTT GAATTTCTCC 1200 
ACCAATGCCA TTTGGAAAAG GCA6GATTCA AAGATACCAA 1280 
GAAACCACTT CTTGGTGGGA CTGTTACTGA GGGAGGTGGG 1360 
ATCAGTGTGC TCAA6AACCT GCTGATAAAG CATTCTTTTG 1440 
CACCCTCTAC CTGCCTCTGT TTGGTCTGCT GATTGATU^C 1520 
TGAACGC6GG CATGACCGTG AAGGATGAAT CCCTGGCTCT 1600 
ACCCTGGACA ACAGCCTGCA CAAGGACCTG CTGGGCGCCA 1680 
CATCAACAGT GTGAGAAATG CTGATTCGAG AGGATCTCTC 1760 
GTGAGAAGAG CAATTCCCTG GATAAGCACC AACAAAGTA6 1840 
CAGTCTGAGA TTAAGAGCCT ACTGATGTGT TTCCTCTACA 1920 
GAACAAGGCT TCAACATCTG AACTTAT6GA TTTTTTTACA 2000 
AGCGATACAT AGCCA6G2AC CAGGAGGGGT TGGGACCCAT 2080 
AACAGAACAG GAAT6ATGCA TGCCAGATTG CAGCAGCTGG 2160 
CCACTCGGAC GCAGAT6TTC TGCACCAGTC ATTACTTGAA 2240 
CGCTTTCTCT ATTTACATTG GCGTTXAAGA ACCAGCTCCT 2320 
GATGTCTACC TGTGTTTTCT TCAAAAACAT CAGTCTGAAA 2400 
TTATAAGTTT CCCTCAACAT TCTATGAAGG GAGAGCGGAC 2480 
ACTCCAAGCT GAGCTCCATC AGGACGGAGG CCTCCCAGCT 2560 
AAGAAGTCCT TTGTCCGGAC ACATTTGCAA GTCATCATAT 2640 
AACCAGATTC CAGCAGTCCC TGTCCATCAT CAACAACTGT 2720 
CTGATGTGAA GGACTTAACC AAAAGGATAC GCACGGTGCT 2800 
GAGATGCTGG TGGACCTCCA GTACAGCCTG GCCAAATCCT 2880 
CATGGCCAGG ATCCATGTCA AAAATGGCGA TCTCTCAGAG 2960 
AATATCTCAC AC6GAAAGGC GTGTTTAGAC AAGGATGCAC 3040 
TCCATGATGG AAGACGTGGG GATGCAGGAT GTCCATTTCA 3120 
TGGACTCTGG AAAGCCGAGC GCTACGAGCT CATCGCCGAC 3200 
ATTTCTTTGA AGATGAAGAT GGAAAGGAGT ATATTTACAA 3280 
CTCCTTAAAC TGTACTCGGA TAAATTTGGT TCTGAAAATG 3360 
TCTGGATTCT AAGTATGCAT ACATCCAGGT GACTCACGTC 3440 
CAGAGTTTGA GAGATCCCAC AACATCCGCC GCTTCATGTT 3520 
GTGGAAGAGC AGTGCAAAC6 GCGCACCATC CTGACAGCCA 3600 
GTACCAGCAC CACACTGACC TGAACCCCAT CGAGGTGGCC 3680 
TGTGCTCCTC GGCCGAGGTG GACATGATCA AACTGCAGCT 37 60 
CCACTAGCAT ATGCGCGAGC TTTCTTAGAT GATACAAACA 3840 
AGTTTTCAGG CAATTTGTGG AAGCTTGCGG TCAAGCCTTA 3920 
ATCAGGAAGA AATGAAAGCC AACTACAGGG AAATGGCGAA 4000 
GAGGAGAAGA CGAGCGTCTT ACCGAATTCC CTTCACATCT 4080 
CGGGATGACC AGCTCGTCTT CGGTCGTGTG ATTACATCTC 4160 
CAGGATGCTT TCCAAAGCCA ATCACTGG66 AGACCGAGCA 4240 
AACAACGTTA TTTCTTAACA GACTTTCTAT AGGAGTTGTA 4320 
CAAAGTTTTC ATTGTGTCTT AACAAAGGTG TGGTAGACAC 4400 
GTGTTAGAAT AGATGGCCTA CAGAAAAAAA AGGTTCTGGG 4480 
GGGGACCTTT TGCCTCGACT CGTGCCGGAA ATCTGATCGT 4560 
TTGTATGACT AGGATTTGTG CTATTATCTC ATTCAACAAC 4640 
TTAATCCGCT ACTGGCTTCA AGTCAGAACT TTGTCATTAA 4720 
CATTTTTAAT ACTCACATGG GCTTATGCAT TAAGTTTAAT 4800 
TAATGGTTTA TTCTTGTCAT AAAAATGT6C AATATGGAGA 4880 
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1 MEGHVMIAFL PTIIJ^QLFRV LTRATQEEVA VNVTRVIIHV 
81 SMTTILKPSA DFLTSNKLLR YSWFFFDVLI KSMAQHLIEN 
161 SKNANHSLAV FIKRCFTFMD RGFVFKQIKN YISCFAPGDP 
241 QLDYSLTDEF CRNHFLVGLL LREVGTAI^E FREVRLIAIS 
321 RINVBDVSPF PVNAGMTVKD ESIALPAVNP LVTPQKGSTI. 
401 TDSGNSLPER NSEKSNSIJ3K HQQSSTLGNS WRCDKLDQS 
481 EVCLHQFQYM GKRYIABNQE GLGPIVHDRK SQTLPVSRNR 
561 lATEVCLTAL DTLSLFTLAF KNQUADHGH NPIiMKKVFDV 
641 AALCYEILKC CNSKLSSIRT EASQLLYFLM RNNFDYTGKK 
721 SDRLIKHTSF SSDVKDLTKR IRTVLMATAQ MKEHENDPEM 
801 MCYVHVTALV AEYLTRKGVF RQGCTAFRVI TPNIDEEASM 
881 KLIIPIYEKR RDFFEDEDGK EYIYKBPKLT PLSEISQRLL 
961 FFDEKELQER KTEFERSHNI RRIMFEMPFT QTGKRQGGVE 
1041 EMSKKVAELR QLCSSAEVDM IKI^LKLQGS VSVQVNAGPL 
1121 NERLIKEDQL EYQEEMKANY REMAKELSEI MHEQICPLEE 
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VADCHEEGLE SHLRSYVKYA YKAEPYVASE YKTVHEELTK 80 
SKVKLUWQR FPASYHHAAE. TWNMtMPHI TQKFGDNPEA 160 
KTLFEYKFEF LRWCNHEHY IPUILPMPFG KGRIQRYQDL 240 
VLKNLLIKHS FDDRYASRSH QARIATLYLP LFGLLIENVQ 320 
DNSLHKDLLG AISGIASPYT TSTPNINSVR NADSRGSLIS 400 
EIKSIXMCFL YILKSMSDDA LFTYWNKAST SEI24DFFTIS 480 
TGMMHARLQQ LGSLDNSLTF NHSYGHSDAD VLHQSLLEAN 560. 
YLCFI^KHQS ETALKNVFTA IMLIYKFPS TFYEGRADMC 640 
SFVRTHLQVI ISVSQLIADV VGIGETRFQQ SLSIINNCAN 720 
LVDLQYSLAK SYASTPELRK TWLDSMARIH VKNGDLSEAA 800 
MEDVGMQDVH FNEDVIWELL EQCADGLWKA ERYELIADIY 880 
KLYSDKFGSE NVKMIQDSGK VNPKDIJ3SKY AYIQVTHVIP 9 60 
EQCKRRTILT AIHCFPYVKK RIPVMYQHHT DLNPIEVAID 1040 
AYARAFLDDT NTKRYPDNKV KLLKEVFRQF VEACGQALAV 1120 
KTSVLPNSLH IFNAISGTPT STMVHGMTSS SSW 1195 
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1 AATTGTAATA CGACTCACTA TAGGGCGAAT TGGGTACCGG 
81 GAATTCGGCA CGAGTTTTAC ACCATCACCA AAACCCAGAA 
161 ATGAAAAGCA CCACCTGTTG CTCACATTCT TCCATGTCAG 
241 GTCGTTGAAA CCCAAGTTGG CTACTCCTGG CTTCCCCTCC 
321 GGTCTCG6CG AACCTTCCTT CGGGCTATCT TGGCTACCAA 
401 GGGTAGATGG AGGCAAGCCA CTGCTGAAAA TTTCCACTCA 
481 ATTTTTTCCA GTACTGTCAG AAAACCGAAT CTGGAGCCCA 
561 CATGCGATGG AAGGCCACGT GATGATCGCC TTCTTGCCCA 
641 ACAGGAAGAA GTCGCGGTTA ACGTGACTCG GGTCATTATT 
721 ACTTGAGGTC ATATGTTAAG TACGCGTATA AGGCTGAGCC 
801 ACCAAATCCA TGACCACGAT TCTCAAGCCT TCTGCCGATT 
881 CTTTGATGTA CTGATCAAAT CTATGGCTCA GCATTTGATA 
961 CTGCATCCTA TCATCATGCA GCGGAAACC6 TTGTAAATAT 
1041 GAGGCATCTA AGAACGCGAA TCATAGCCTT GCTGTCTTCA 
1121 CAAGCAGATC AACAACTACA TTAGCTGTTT TGCTCCTGGA 
1201 6T6TAGTGTG CAACCATGAA CATTATATTC CGTTGAACTT 
1281 GACCTCCAGC TTGACTACTC ATTAACAGAT GAGTTCTGCA 
1361 GACAGCCCTC CAGGAGTTCC GGGAGGTCCG TCTGATCGCC 
1441 ATGACAGATA TGCTTCAAGG AGCCATCAGG CAAGGATAGC 
1521 GTCCAGCGGA TCAATGTGAG GGATGTGTCA CCCTTCCCTG 
1601 ACCAGCTGTG AATCCGCTGG TGAC6CCGCA GAAGGGAAGC 
1681 TCTCCGGCAT TGCTTCTCCA TATACAACCT CAACTCCAAA 
1761 ATAAGCACAG ATTCGGGTAA CA6CCTTCCA GAAAfiGAATA 
1841 CACATTGGGA AATTCCGTG6 TTCGCTGTGA TAAACTTGAC 
1921 TCTTAAAGAG CATGTCTGAT GATGCTTTGT TTACATATTG 
2001 ATATCTGAAG TCTGCCTGCA CCAGTTCCAG TACATGGGGA 
2081 AGTTCATGAT CGAAAGTCTC AGACATTGCC TGTTTCCCGT 
2161 GCAGCCTGGA TAACTCTCTC ACTTTTAACC ACAGCTATGG 
2241 GCCAACATTG CTACTGAGGT TTGCCTGACA GCTCTGGACA 
2321 GGCCGACCAT GGACATAATC CTCTCATGAA AJU^WSTTTTT 
2401 CGGCTTTAAA AAAT6TCTTC ACTGCCTTAA GGTCCTTAAT 
2481 ATGTGTGC66 CTCTGTGTTA CGAGATTCTC AAGTGCTGTA 
2561 GCTCTACTTC CTGATGAGGA ACAACTTTGA TTACACTGGA 
2641 CTGTCAGCCA 6CTGATAGCA 6ACGTTGTTG GCATTGGGGA 
2721 GCCAACAGTG ACCGGCTTAT TAAGCACACC AGCTTCTCCT 
2801 AATGGCCACC GCCCAGATGA AGGAGCATGA GAACGACCCA 
2881 ATGCCAGCAC GCCCGAGCTC AGGAAGACGT GGCTCGACAG 
2961 GCAGCAATGT GCTATGTCCA CGTAACAGCC CTAGTGGCAG 
3041 CGCCTTCAGG GTCATTACCC CAAACATCGA CGAGGAGGCC 
3121 ACGAGGATGT GCTGATGGAG CTCCTTGAGC AGTGCGCAGA 
3201 ATCTACAAAC TTATCATCCC CATTTATGAG AAGCGGAGGG 
3281 GGAACCCAAA CTCACACCGC TGTC6GAAAT TTCTCAGAGA 
3361 TCA2VAATGAT ACAGGATTCT GGCAAGGTCA ACCCTAAGGA 
3441 ATCCCCTTCT TTGACGAAAA AGAGTTGCAA GAAAGGAAAA 
3521 TGAGATGCCA TTTACGCAGA CCGGGAAGAG GCAGGGCGGG 
3601 TACACTGCTT CCCTTATGTG AAGAAGCGCA TCCCTGTCAT 
3681 ATTGACGAGA TGAGTAAGAA GGTGGCGGAG CTCCGGCAGC 
3761 CAAACTCCAG GGCAGCGTGA GTGTTCAGGT CAATGCTGGC 
3841 CAAAGCGATA TCCTGACAAT AAAGTGAAGC TGCTTAAGGA 
3921 GCG6TAAACG AACGTCTGAT TAAAGAAGAC CAGCTCGAGT 
4001 GGAGCTTTCT GAAATCATGC ATGAGCAGAT CTGCCCCCTG 
4081 TCAACGCCAT CAGTGGGACT CCAACAAGCA CAATGGTTCA 
4161 ATGGCCCGTG TGTGGGGACT TGCTTTGTCA TTTGCAAACT 
4241 CAGGGAGGAC CAAGGGGAAG GGGAGAGAAA GGAAATAAAG 
4321 AGAAGGTGCA CATATTTTTT TAAATCTCAC TGGCAATATT 
4401 TCTTGAGCTG GACTTAGATT TTATTCTTCC TTGCAGAGTA 
4481 ATCTACATGG CAGGGAGGGC TGCA;CTGACA TTGATGCCTG 
4561 AATCAGGGTA CAGAACTTAC TAGTTTTGTC TAGGAGTATG 
4641 ATAGAGCAAG AATAGTGAGC TAACTGAGCT agacactcaa 
4721 tcatcgactc cgggacggtc atatatgtat tacatttcta 
4801 tgtgataaat ttgtgctggt ccagtatatg caatacactt 
4881 tdtatacaag tctttact 
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gccccccctc gaggtcgacg gtatcgataa GCTTGATATC 80 

TTTTATGATG AGATTAAAAT. AGAGTTGCCC ACTCAGCTGC 160 
CTGTGACAAC TCAAGTAAAG GZ^GCACGAA GAAGAGGGAT 240 
TGAAAGACGG AAGGGTGGTG ACAAGCGAGC AGCACATCCC 320 
GAGCTTGGGA TGGGCAGGCA TTATGGTCCG GAAATTAAAT 400 
TCTGGTTTCT ACAGGGATAC TCAGGATCAG CATTTACATA 480 
AGCCTTAGGA AACGAACTTG TAAAGTACCT TAAGAGTCTG 560 
CTATCCTAAA CCAGCTGTTC CGAGTCCTCA CCAGAGCCAC 640 
CATGTGGTTG CCCAGTGCCA TGAGGAAGGA TTGGAGAGCC 720 
ATATGTTGCC TCTGAATACA AGACAGTGCA TGAAGAACTG 800 
TCCTCACCAG CAACAAACTA CTGAGGTACT CATGGTTTTT 880 
GAGAACTCCA AAGTTAAGTT GCTGCGAAAC CAGAGATTTC 960 
GCTGATGCCA CACATCACTC AGAAGTTTGG AGATAATCCA 1040 
TCAAGAGATG TTTCACCTTC ATGGACAGGG GCTTl'GTCTT 1120 
GACCCAAAGA CCCTCTTTGA ATACAAGTTT GAATTTCTCC 1200 
ACCAATGCCA TTTGGAAAAG GCAGGATTCA AAGATACCAA 1280 
GAAACCACTT CTTGGTGGGA CTGTTACTGA GGGAGGTGGG 1360 
ATCAGTGTGC TCAAGAACCT GCTGATAAAG CATTCTTTTG 1440 
CACCCTCTAC CTGCCTCTGT TTGGTCTGCT GATTGAAAAC 1520 
TGAACGCGGG CATGACCGTG AAGGATGAAT CCCTGGCTCT 1600 
ACCCTGGACA ACAGCCTGCA CAAGGACCTG CTGGGCGCCA 1680 
CATCAACAGT GTGAGAAATG CTGATTCGAG AGGATCTCTC 1760 
GTGAGAAGAG CAATTCCCTG GATAAGCACC AACAAAGTAG 1840 
CAGTCTGAGA TTAAGAGCCT ACTGATGTGT TTCCTCTACA 1920 
GAACAAGGCT TCAACATCTG AACTTATGGA TTTTTTTACA 2000 
AGCGATACAT AGCCAGGAAC CAGGAGGGGT TGGGACCCAT 2080 
AACAGAACAG GAATGATGCA T6CCAGATTG CAGCAGCTGG 2160 
CCACTCGGAC GCA6ATGTTC TGCACCAGTC ATTACTTGAA 2240 
CGCTTTCTCT ATTTACATTG GCGTTTAA6A ACCAGCTCCT 2320 
GATGTCTACC TGTGTTTTCT TCAAAAACAT CAGTCTGAAA 2400 
TTATAAGTTT CCCTCAACAT TCTATGAAGG GAGAGCGGAC 2480 
ACTCCAAGCT GAGCTCCATC AGGACGGAGG CCTCCCAGCT 2560 
AAGAAGTCCT TTGTCCGGAC ACATTTGCAA GTCATCATAT 2640 
AACCAGATTC CAGCAGTCCC TGTCCATCAT CAACAACTGT 2720 
CTGATGTGAA GGACTTAACC AAAAGGATAC GCACGGTGCT 2800 
GAGATGCTGG TGGACCTCCA GTACAGCCTG GCCAAATCCT 2880 
CATGGCCAGG ATCCATGTCA AAAAT6GCGA TCTCTCAGAG 2960 
AATATCTCAC ACGGAAAGGC GTGTTTAGAC AAGGATGCAC 3040 
TCCATGATGG AAGACGTGGG GATGCAGGAT GTCCATTTCA 3120 
TGGACTCT6G AAAGCCGAGC GCTACGAGCT CATCGCCGAC 3200 
ATTTCTTTGA AGATGAAGAT GGAAAGGAGT ATATTTACAA 3280 
CTCCTTAAAC TGTACTCGGA TAAATTTGGT TCTGAAAATG 3360 
TCTGGATTCT AAGTATGCAT ACATCCAGGT GACTCACGTC 3440 
CAGAGTTTGA GAGATCCCAC AACATCCGCC GCTTCAT6TT 3520 
GTGGAAGAGC AGTGCAAACG GCGCACCATC CTGACAGCCA 3600 
GTACCAGGAC CACACTGACC TGAACCCCAT CGAGGTGGCC 3680 
TGTGCTCCTC GGCC6AGGTG 6ACATGATCA AACTGCAGCT 37 60 
CCACTAGCAT ATGCGC6AGC TTTCTTAGAT GATACAAACA 3840 
AGTTTTCAGG CAATTTGTGG AAGCTTGCGG TCAAGCCTTA 3920 
ATCAGGAAGA AATGAAAGCC AACTACAGGG AAATGGCGAA 4000 
GAGGAGAAGA CGAGCGTCTT ACCGAATTCC CTTCACATCT 4080 
CGGGATGACC AGCTCGTCTT CGGTCGTGTG ATTACATCTC 4160 
CAGGATGCTT TCCAAAGCCA ATCACT6GGG AGACCGAGCA 4240 
AACAACGTTA TTTCTTAACA GACTTTCTAT AGGAGTTGTA 4320 
CAAAGTTTTC ATTGTGTCTT AACAAAGGTG TGGTAGACAC 4400 
GTGTTAGAAT AGATGGCCTA CAGAAAAAAA AGGTTCTGGG 4480 
GGGGACCTTT TGCCTCGACT CGTGCCGGAA ATCTGATCGT 4560 
TTGTATGACT AGGATTTGTG CTATTATCTC ATTCAACAAC 4640 
TTAATCCGCT ACTGGCTTCA AGTCAGAACT TTGTCATTAA 4720 
j^TTTTTAAT ACTCACATGG GCTTATGCAT TAAGTTTAAT 4800 
TAATGGTTTA TTCTTGTCAT AAAAATGTGC AATATGGAGA 4880 
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1 MEGHVMIAFL PTII^^QLFRV LTRATQEEVA VNVTRVIIHV 
81 SMTTILKPSA DFLTSNKLLR YSWFFFDVLI KSMAQHLIEN 
161 SKNANHSIAV FIKRCFTFMD RGFVFKQINN YISCFAP6DP 
241 QLDYSLTDEF CRNHFLVGLL LREVGTALQE FREVBI.IAIS 
321 RXNVKDVSPF PVNAGMTVKD ESIALPAVNP LVTPQKGSTL 
401 TDSGNSLPER NSEKSNSLDK HQQSSTLGNS WRCDKLDQS 
481 EVCLHQFQYM GKRYIARNQE GLGPIVHDRK SQTLPVSRNR 
561 lATEVCLTJ^ DTLSLFTLRF KNQLIADHGH NPLMKKVFDV 
641 AALCYEILKC CNSKLSSIRT EASQLLYFLM RNNFDYTGKK 
721 SDKLIKHTSF SSDVKDLTKR IRTVl^IATAQ MKEHENDPEM 
801 MCYVHVTALV AEYLTRKGVF RQGCTAFRVT TPNIDEEASM 
881 KLIIPIYEKR RDFFEDEDGK EYIYKEPKLT PLSEISQRLL 
961 FFDEKELQER KTEFERSHNI BRFMFEMPFT QTGKRQGGVE 
1041 EMSKKVAELR QLCSSAEVDM IKLQLKLQGS VSVC2VNAGPL 
1121 NERLIKEDQL EYQEEMKANY FEMZOCELSEI MHEQICPIiSE 
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VAQCHEEGLE SHLRSYVKYA YKAEPYVASE YKTVHEELTK 80 
SKVKLLRNQR FPASYHHAM. TWNMLMPHI TQKFGDNPEA 160 
KTIaFEYKFEF MWVCNHEHY IPLNLPMPFG KGRIQRYQDL 240 
VLKNLLIKHS FDDRYASRSH QZ^IATLYLP LFGLLIENVQ 320 
DNSLHKDliG AISGIASPYT TSTPNINSVR NADSRGSLIS 400 
EIKSLIMCFL YILKSMSDDA LFTYWNKAST SEIMDFFTIS 480 
TGMMHAKLQQ LGSLDNSLTF NHSYGHSDAD VLHQSLLEAN 560 
YLCFLQKHQS ETALKNVFTA LRSLIYKFPS TFYEGRADMC 640 
SFVRTHLQVI ISVSQLIADV VGIGETRFQQ SLSIINNCAN 720 
LVDLQYSLAK SYASTPEU<K TWUDSMARIH VKNGDLSEAA 800 
MEDVGMQDVH FNEDVM:I*L EQCM5GLWKA ERYELIADIY 880 
KLYSDKFGSE NVKMIQDS6K VNPKDU5SKY AYIQVTHVIP 960 
EQCKRRTILT AIHCFPYVKK RIPVMYQHHT DLNPIEVAID 1040 
AYARAFLDDT NTKRYPDNKV KLLKEVFRQF VEACGQALAV 1120 
KTSVLPNSLH IFNAISGTPT STMVHGMTSS SSW 1195 
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1 AATTGTAATA CGACTCACTA TAGGGCGAAT TGGGTACCGG 
81 GAATTCGGCA CGAGTTTTAC ACCATCACCA AAACCCAGAA 
161 ATGAAAAGCA CCACCTGTTG CTCACATTCT TCCATGTCAG 
241 GTCGTTGAAA CCCAAGTTGG CTACTCCTGG CTTCCCCTCC 
321 GGTCTCGGCG AACCTTCCTT CGGGCTATCT TGGCTACCAA 
401 GGGTAGATGG AGGCAAGCCA CTGCTGAAAA TTTCCACTCA 
481 ATTTTTTCCA GTACTGTCAG AAAACCGAAT CTGGAGCCCA 
561 CATGCGATGG AAGGCCACGT GATGATCGCC TTCTTGCCCA 
641 ACAGGAAGAA GTCGCGGTTA ACGTGACTCG GGTCATTATT 
721 ACTTGAGGTC ATATGTTAAG TACGCGTATA AGGCTGAGCC 
801 ACCAAATCCA TGACCACGAT TCTCAAGCCT TCTGCCGATT 
881 CTTTGATGTA CTGATCAAAT CTATGGCTCA GCATTTGATA 
961 CTGCATCCTA TCATCATGCA GCGGAAACCG TTGTAAATAT 
1041 GAGGCATCTA AGAACGCGAA TCATAGCCTT GCTGTCTTCA 
1121 CAAGCAGATC AACAACTACA TTAGCTGTTT TGCTCCTGGA 
1201 GTGTAGTGTG CAACCATGAA CATTATATTC CGTTGAACTT 
1281 GACCTCCAGC TTGACTACTC ATTAACAGAT GAGTTCTGCA 
1361 GACAGCCCTC CAGGAGTTCC GGGAGGTCCG TCTGATCGCC 
1441 ATGACAGATA TGCTTCAAGG AGCCATCAGG CAAGGATAGC 
1521 GTCCAGCGGA TCAATGTGAG GGATGTGTCA CCCTTCCCTG 
1601 ACCAGCTGTG AATCCGCTGG TGACGCCGCA GAAGGGAAGC 
1681 TCTCCGGCAT TGCTTCTCCA TATACAACCT CAACTCCAAA 
1761 ATAAGCACAG ATTCGGGTAA CAGCCTTCCA GAAAGGAATA 
1841 CACATTGGGA AATTCCGTGG TTCGCTGTGA TAAACTTGAC 
1921 TCTTAAAGAG CATGTCTGAT GATGCTTTGT TTACATATTG 
2001 ATATCTGAAG TCTGCCTGCA CCAGTTCCAG TACATGGGGA 
2081 AGTTCATGAT CGAAAGTCTC AGACATTGCC TGTTTCCCGT 
2161 GCAGCCTGGA TAACTCTCTC ACTTTTAACC ACAGCTATGG 
2241 GCCAACATTG CTACTGAGGT TTGCCTGACA GCTCTGGACA 
2321 GGCCGACCAT GGACATAATC CTCTCATGAA AAAAGTTTTT 
2401 CGGCTTTAAA AAATGTCTTC ACTGCCTTAA GGTCCTTAAT 
2481 ATGTGTGCGG CTCTGTGTTA CGAGATTCTC AAGTGCTGTA 
2561 GCTCTACTTC CTGATGAGGA ACAACTTTGA TTACACTGGA 
2641 CTGTCAGCCA GCTGATAGCA GACGTTGTTG GCATTGGGGA 
2721 GCCAACAGTG ACCGGCTTAT TAAGCACACC AGCTTCTCCT 
2801 AATGGCCACC GCCCAGATGA AGGAGCATGA GAACGACCCA 
2881 ATGCCAGCAC GCCCGAGCTC AGGAAGACGT 6GCTCGACAG 
2961 GCAGCAATGT GCTATGTCCA CGTAACAGCC CTAGTGGCAG 
3041 CGCCTTCAGG GTCATTACCC CAAACATCGA CGAGGAGGCC 
3121 ACGA6GATGT GCTGATGGAG CTCCTTGAGC AGTGCGCAGA 
3201 ATCTACAAAC TTATCATCCC CATTTATGAG AAGCGGAGGG 
3281 GGAACCCAAA CTCACACCGC TGTCGGAAAT TTCTCAGAGA 
3361 TCAAAATGAT ACAGGATTCT GGCAAGGTCA ACCCTAASGA 
3441 ATCCCCTTCT TTGACGAAAA AGAGTTGCAA GAAAGGAAAA 
3521 TGAGATGCCA TTTACGCAGA CCGGGAAGAG GCAGGGCGGG 
3601 TACACTGCTT CCCTTATGTG AAGAAGCGCA TCCCTGTCAT 
3681 ATTGACGAGA TGAGTAAGAA GGTGGCGGAG CTCCGGCAGC 
3761 CAAACTCCA6 GGCAGCGTGA GTGTTCAGGT CAATGCTGGC 
3841 CAAAGCGATA TCCTGACAAT AAAGTGAAGC TGCTTAAGGA 
3921 GCGGTAAACG AACGTCTGAT TAAAGAAGAC CAGCTCGAGT 
•4001 GGAGCTTTCT GAAATCATGC ATGAGCAGAT CTGCCCCCTG 
4081 TCAACGCCAT CAGTGGGACT CCAACAAGCA CAATGGTTCA 
4161 ATGGCCCGTG TGTGGGGACT TGCTTTGTCA TTTGCAAACT 
4241 CAGGGAGGAC CAAGGGGAAG GGGAGAGAAA GGAAATAAAG 
4321 AGAAGGTGCA CATATTTTTT TAAATCTCAC TGGCAATATT 
4401 TCTTGAGCTG GACTTAGATT TTATTCTTCC TTGCAGAGTA 
4481 ATCTACATGG CAGGGAGGGC TGCACTGACA TTGATGCCTG 
4561 AATCAGGGTA CAGAACTTAC TAGTTTTGTC TAGGAGTATG 
4641 ATAGAGCAAG AATAGTGAGC TAACTGAGCT AGACACTCAA 
4721 TCATCGACTC CGGGACGGTC ATATATGTAT TACATTTCTA 
4801 TGTGATAAAT TTGTGCTGGT CCAGTATATG CAATACACTT 
4881 TGTATACAAG TCTTTACT 
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GCCCCCCCTC GAGGTCGACG GTATCGATAA GCTTGATATC 80 
TTTTATGATG AGATTAAAAT AGAGTTGCCC ACTCAGCTGC 160 
CTGTGACAAC TCAAGTAAAG GAAGCACGAA GAAGAGGGAT 240 
TGAAAGACGG AAGGGTGGTG ACAAGCGAGC AGCACATCC.C 320 
GAGCTTGGGA TGGGCAGGCA TTATGGTCCG GAAATTAAAT 400 
TCTGGTTTCT ACAGGGATAC TCAGGATCAG CATTTACATA 480 
AGCCTTAGGA AACGAACTTG TAAAGTACCT TAAGAGTCTG 560 
CTATCCTAAA CCAGCTGTTC CGAGTCCTCA CCAGAGCCAC 640 
CATGTGGTTG CCCAGTGCCA TGAGGAAGGA TTGGAGAGCC 720 
ATATGTTGCC TCTGAATACA AGACAGTGCA TGAAGAACTG 800 
TCCTCACCAG CAACAAACTA CTGAGGTACT CATGGTTTTT 880 
GAGAACTCCA AAGTTAAGTT GCTGCGAAAC CAGAGATTTC 960 
GCTGATGCCA CACATCACTC AGAAGTTTGG AGATAATCCA 1040 
TCAAGAGATG TTTCACCTTC ATGGACAGGG GCTTTGTCTT 1120 
GACCCAAAGA CCCTCTTTGA ATACAAGTTT GAATTTCTCC 1200 
ACCAATGCCA TTT6GAAAAG GCAGGATTCA AAGATACCAA 1280 
GAAACCACTT CTTGGTGGGA CTGTTACTGA GGGAGGTGGG 1360 
ATCAGTGTGC TCAAGAACCT GCTGATAAAG CATTCTTTTG 1440 
CACCCTCTAC CTGCCTCTGT TTGGTCTGCT GATTGAAAAC 1520 
TGAACGCGGG CATGACCGTG AAGGATGAAT CCCTGGCTCT 1600 
ACCCTGGACA ACAGCCTGCA CAAGGACCTG CTGGGCGCCA 1680 
CATCAACAGT GTGAGAAATG CTGATTCGAG AGGATCTCTC 1760 
GTGAGAAGAG CAATTCCCT6 GATAAGCACC AACAAAGTAG 1840 
CAGTCTGAGA TTAAGAGCCT ACTGATGTGT TTCCTCTACA 1920 
GAACAAGGCT TCAACATCTG AACTTATGGA TTTTTTTACA 2000 
ScSiACAT AGCCAGGAAC CAGGAGGGGT TGGGACCCAT 2080 
AACAGAACAG GAATGATGCA TGCCAGATTG CAGCAGCTGG 2160 
CCACTCGGAC GCAGATGTTC TGCACCAGTC ATTACTTGAA 2240 
CGCTTTCTCT ATTTACATTG GCGTTTAAGA ACCAGCTCCT 2320 
GATGTCTACC TGTGTTTTCT TCAAAAACAT CAGTCTGAAA 2400 
TTATAAGTTT CCCTCAACAT TCTATGAAGG GAGAGCGGAC 2480 
ACTCCAAGCT GAGCTCCATC AGGACGGAGG CCTCCCAGCT 2560 
AAGAAGTCCT TTGTCCGGAC ACATTTGCAA GTCATCATAT 2640 
AACCAGATTC CAGCAGTCCC TGTCCATCAT CAACAACTGT 2720 
^S^S ^CTTAACC AAAAGGATAC GCACGGTGCT 2800 
GAGATGCTGG TGGACCTCCA GTACAGCCTG GCCAAATCCT 2880 
CATGGCCAGG ATCCATGTCA AAAATGGCGA TCTCTCAGAG 2960 
AATATCTCAC ACGGAAAGGC GTGTTTAGAC AAGGATGCAC 3040 
TCCATGATGG AAGACGTGGG GATGCAGGAT GTCCATTTCA 3120 
TGGACTCTGG AAAGCCGAGC GCTACGAGCT CATCGCCGAC 3200 
ATTTCTTTGA AGATGAAGAT GGAAAGGAGT ATATTTACAA 3280 
CTCCTTAAAC TGTACTCGGA TAAATTTGGT TCTGAAAATG 3360 
StGGATTCT AAGTATGCAT ACATCCAGGT GACTCACGTC 3440 
CAGAGTTTGA GAGATCCCAC AACATCCGCC GCTTCATGTT 3520 
GTGGAAGAGC AGTGCAAACG GCGCACCATC CTGACAGCCA 3600 
GTACCAGCAC CACACTGACC TGAACCCCAT CGAGGTGGCC 3680 
TGTGCTCCTC GGCCGAGGTG GACATGATCA AACTGCAGCT 3760 
CCACTAGCAT ATGCGCGAGC TTTCTTAGAT GATACAAACA 3840 
AGTTTTCAGG CAATTTGTGG AAGCTTGCGG TCAAGCCTTA 3920 
atSggaaga AATGAAAGCC AACTACAGGG AAATGGCGAA 4000 
GAGGAGAAGA C6AGCGTCTT ACCGAATTCC CTTCACATCT 4080 
CGGGAT6ACC AGCTCGTCTT CGGTCGTGTG ATTACATCTC 4160 
CAGGATGCTT TCCAAAGCCA ATCACTGGGG AGACCGAGCA 4240 
AACAACGTTA TTTCTTAACA GACTTTCTAT AGGAGTTGTA 4320 
CAAAGTTTTC ATTGTGTCTT AACAAAGGTG TGGTAGACAC 4400 
GTGTTAGAAT AGATGGCCTA CAGAAAAAAA AGGTTCTGGG 4480 
GGGGACCTTT TGCCTCGACT CGTGCCGGAA ATCTGATCGT 4560 
TTGTATGACT AGGATTTGTG CTATTATCTC ATTCAACAAC 4640 
TTAATCCGCT ACTGGCTTCA AGTCAGAACT TTGTCATTAA 4720 
CATTTTTAAT ACTCACATGG GCTTATGCAT TAAGTTTAAT 4800 
TAATGGTTTA TTCTTGTCAT AAAAATGTGC AATATGGAGA 4880 
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1 MEGHVMIAFL PTII^NQLFRV LTRATQEEVA VNVTRVIIHV 
81 SMTTILKPSA DFLTSNKIiliR YSWFFFDVLI KSMAQHLIEN 
161 SKKANHSIAV FIKRCFTFMD RGFVFKQINN YISCFAPGDP 
241 QLDYSLTDEF CRNHFLVGLL LREVGTAI^E FREVKLIAIS 
321 RINVRDVSPF PVNAGblTVKD ESIALPAVNP LVTPQKGSTL 
401 TDSGNSLPER NSEKSNSLDK HQQSSTLGNS VVRCDKU>2S 
481 EVCLHQFQYM GKRYIABNQE GLGPIVHDRK SQTLPVSRNR 
561 lATEVCLTAL DTLSLFTLAF KNQLLADHGH NPI14KKVFDV 
641 AALCYEILKC CNSKLSSIRT EASQLLYFUXl RNNFDYTGKK 
721 SDRLIKHTSF SSDVKDLTKR IRTVLMATAQ MKEHENDPEM 
801 MCYVHVTALV AEYLTRKGVF RQGCTAFRVI TPNIDEEASM 
881 KLIIPIYEKR RDFFEDEDGK EYIYKEPKLT PLSEISQRIiL 
961 FFDEKELQER KTEFERSHNI RRIMFEMPFT QTGKRQGGVB 
1041 EMSKKVAELR QLCSSAEVDM IKLQLKLQGS VSVQVNAGPL 
1121 NERLIKEDQL EYQEEMKANY REMAKELSEI MHEQICPLEE 
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VAQCHEEGLE SHLRSYVKYA YKAEPYVASE YKTVHEELTK 80 
SKVKLLPNQR FPASYHHAAE. TWNMLMPHI TQKFGDNPEA 160 
KTLFEYKFEF UlWCNHEHY IPLNLPMPFG KGRIQRYQDL 240 
VLKNI.LIKHS FDDRYASRSH QARIATLYI^P UGLLIENVQ 320 
DNSLHKDLLG AISGIASPYT TSTPNINSVR NADSRGSLIS 400 
EIKSLI24CFL YILKSMSDDA LFTYWNKAST SEIMDFFTIS 480 
TGMMHARLQQ LGSUDNSLTF NHSYGHSDAD VLHQSIiEAN 560 
YliCFLQKHQS EXALKNVFTA LRSLIYKFPS TFYEGRADMC 640 
SFVRTHlrQVI ISVSQLIADV VGIGBTRFQQ SLSIINNCAN 720 
LVDLQYSLAK SYASTPEUOC TWLDSMARIH VKNGDLSEAA 800 
MEDVGM2DVH FNEDVLMELL EQCADGLWKA ERYELIADIY 880 
KLYSDKFGSE NVIoaQDSGK VNPKDUDSKY AYIQVTHVIP 960 
EQCKRRTILT AIHCFPYVKK RIPVMYQHHT DLNPIEVAID 1040 
AYARAFLDDT NTKRYPDNKV KKLKEVFRQF VEACGQALAV 1120 
KTSVLPNSLH IFNAISGTPT STMVHGMTSS. SSW 1195 
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1 AATTGTAATA CGACTCACTA TAGGGCGAAT TGGGTACCGG 
81 GAATTCGGCA CGAGTTTTAC ACCATCACCA AAACCCAGAA 
161 ATGAAAAGCA CCACCTGTTG CTCACATTCT TCCATGTCAG 
241 GTCGTTGAAA CCCAAGTTGG CTACTCCTGG CTTCCCCTCC 
321 GGTCTCGGCG AACCTTCCTT CGGGCTATCT TGGCTACCAA 
401 GGGTAGATGG A'GGCAAGCCA CTGCTGAAAA TTTCCACTCA 
481 ATTTTTTCCA GTACTGTCAG AAAACCGAAT CTGGAGCCCA 

561 CATGCGATGG aaggccacgt gatgatcgcc ttcttgccca 
641 acaggaagaa gtcgcggtta acgtgactcg ggtcattatt 
721 acttgaggtc atatgttaag tacgcgtata aggctgagcc 

801 ACCAAATCCA TGACCACGAT TCTCAAGCCT TCTGCCGATT 
881 CTTTGATGTA CTGATCAAAT CTATGGCTCA GCATTTGATA 
961 CTGCATCCTA TCATCATGCA GCGGAAACCG TTGTAAATAT 
1041 GAGGCATCTA AGAACGCGAA TCATAGCCTT GCTGTCTTCA 
1121 CAAGCAGATC AACAACTACA TTAGCTGTTT TGCTCCTGGA 
1201 GTGTAGTGTG CAACCATGAA CATTATATTC CGTTGAACTT 
1281 GACCTCCAGC TTGACTACTC ATTAACAGAT GAGTTCTGCA 
1361 GACAGCCCTC CAGGAGTTCC GGGA6GTCCG TCT6ATCGCC 
1441 ATGACAGATA TGCTTCAAGG AGCCATCAGG CAAGGATAGC 
1521 GTCCAGCGGA TCAATGTGAG GGATGTGTCA CCCTTCCCTG 
1601 ACCAGCTGTG AATCCisCTGG TGACGCCGCA GAAGGGAAGC 
1681 TCTCCGGCAT TGCTTCTCCA TATACAACCT CAACTCCAAA 
1761 ATAA6CACAG ATTCGGGTAA CAGCCTTCCA GAAAGGAATA 
1841 CACATTGGGA AATTCCGTGG TTCGCTGTGA TAAACTTGAC 
1921 TCTTAAAGAG CATGTCTGAT GATGCTTTGT TTACATATTG 
2001 ATATCTGAAG TCTGCCTGCA CCAGTTCCAG TACATGGGGA 
2081 AGTTCATGAT CGZ^GTCTC AGACATTGCC TGTTTCCCGT 
2161 GCAGCCTGGA TAACTCTCTC ACTTTTAACC ACAGCTATGG 
2241 GCCAACATTG CTACTGAGGT TTGCCTGACA GCTCTGGACA 
2321 GGCCGACCAT GGACATAATC CTCTCATGAA AAAAGTTTTT 
2401 CGGCTTTAAA AAATGTCTTC ACTGCCTTAA GGTCCTTAAT 
2481 ATGTGTGCGG CTCTGTGTTA CGAGATTCTC AAGTGCTGTA 
2561 GCTCTACTTC CTGATGAGGA ACAACTTTGA TTACACTGGA 
2641 CTGTCAGCCA GCTGATAGCA GACGTTGTTG GCATTGGGGA 
2721 GCCAACAGTG ACCGGCTTAT TAAGCACACC AGCTTCTCCT 
2801 AATGGCCACC GCCCAGATGA AGGAGCATGA GAACGACCCA 
2881 ATGCCAGCAC GCCCGAGCTC A6GAA6ACGT GGCTCGACAG 
2961 GCAGCAATGT GCTATGTCCA CGTAACAGCC CTAGTGGCAG 
3041 CGCCTTCAGG GTCATTACCC CAAACATCGA CGAGGAGGCC 
3121 ACGAGGATGT GCTGATGGAG CTCCTTGAGC AGTGCGCAGA 
3201 ATCTACAAAC TTATCATCCC CATTTATGAG AAGCGGAGGG 
3281 GGAACCCAAA CTCACACCGC TGTCGGAAAT TTCTCAGAGA 
3361 TCAAAATGAT ACAGGATTCT GGCAAGGTCA ACCCTAAGGA 
3441 ATCCCCTTCT TTGACGAAAA AGAGTTGCAA GAAAGGAAAA 
3521 TGAGATGCCA TTTACGCAGA CCGGGAAGAG GCAGGGCGG6 
3601 TACACTGCTT CCCTTAT6TG AAGZUVGCGCA TCCCTGTCAT 
3681 ATTGACGAGA TGAGTAAGAA 6GTGGCGGAG CTCCGGCAGC 
3761 CAAACTCCAG 6GCAGCGTGA 6TGTTCAGGT CAATGCTGGC 
3841 CAAAGCGATA TCCTGACAAT AAAGTGAAGC TGCTTAAGGA 
3921 GCGGTAAACG AACGTCTGAT TAAAGAAGAC CAGCTCGAGT 
4001 GGAGCTTTCT GAAATCATGC ATGAGCAGAT CTGCCCCCTG 
4081 TCAACGCCAT CAGTGGGACT CCAACAAGCA CAATGGTTCA 
4161 ATGGCCCGTG TGTGGGGACT TGCTTTGTCA TTTGCAAACT 
4241 CAGGGAGGAC CAAGGGGAAG GGGAGAGAAA GGAAATAAAG 
4321 AGJ^GGTGCA CATATTTTTT TAAATCTCAC TGGCAATATT 
4401 TCTTGAGCTG GACTTAGATT TTATTCTTCC TTGCAGAGTA 
4481 ATCTACATGG CAGGGAGGGC TGCACTGACA TTGATGCCTG 
4561 AATCA6GGTA CAGAACTTAC TAGTTTTGTC TAGGAGTATG 
4641 ATAGAGCAAG AATAGTGAGC TAACTGAGCT AGACACTCAA 
4721 TCATCGACTC CGGGACGGTC ATATATGTAT TACATTTCTA 
4801 TGTGATAAAT TTGTGCTGGT CCAGTATATG CAATACACTT 
4881 TGTATACAAG TCTTTACT 
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GCCCCCCCTC GAGGTCGACG. GTATCGATAA GCTTGATATC 
TTTTATGATG AGATTAAAAT AGAGTTGCCC ACTCAGCTGC 
CTGTGACAAC TCAAGTAAAG GAAGCACGAA GAAGAGGGAT 
TGAAAGACGG AAGGGTGGTG ACAAGCGAGC AGCACATCCC 
GAGCTTGGGA TGGGCAGGCA TTATGGTCCG GAAATTAAAT 
TCTGGTTTCT ACAGGGATAC TCAGGATCAG CATTTACATA 480 
AGCCTTAGGA AACGAACTTG TAAAGTACCT TAAGAGTCTG 560 
CTATCCTAAA CCAGCTGTTC CGAGTCCTCA CCAGAGCCAC 640 
CATGTGGTTG CCCAGTGCCA TGAGGAAGGA TTGGAGAGCC 720 
ATATGTTGCC TCTGAATACA AGACAGTGCA TGAAGAACTG 800 
TCCTCACCAG CAACAAACTA CTGAGGTACT CATGGTTTTT 880 
GAGAACTCCA AAGTTAAGTT GCTGCGAAAC CAGAGATTTC 960 
GCTGATGCCA CACATCACTC AGAAGTTTGG AGATAATCCA 1040 
TCAAGAGATG TTTCACCTTC ATG6ACAGGG GCTTTGTCTT 1120 
GACCCAAAGA CCCTCTTTGA ATACAAGTTT GAATTTCTCC 1200 
ACCAATGCCA TTTGGAAAAG GCAGGATTCA AAGATACCAA 1280 
GAAACCACTT CTTGGTGGGA CTGTTACTGA GGGAGGTGGG 1360 
ATCAGTGTGC TCAAGAACCT GCTGATAAAG CATTCTTTTG 1440 
C2M2CCTCTAC CTGCCTCTGT TTGGTCTGCT GATTGAAAAC 1520 
TGAACGCGGG CATGACCGTG AAGGATGAAT CCCTGGCTCT 1600 
ACCCTGGACA ACAGCCTGCA CAAGGACCTG CTGGGCGCCA 1680 
CATCAACAGT GTGAGAAATG CTGATTCGAG AGGATCTCTC 1760 
GTGAGAAGAG CAATTCCCTG GATAAGCACC AACAAA6TAG 1840 
CAGTCTGAGA TTAAGAGCCT ACTGAT6TGT TTCCTCTACA 1920 
GAACAAGGCT TCAACATCTG AACTTAT6GA TTTTTTTACA 2000 
AGCGATACAT AGCCAGGAAC CAGGA6GGGT TGGGACCCAT 2080 
AACAGAACAG GAATGATGCA TGCCAGATTG CAGCAGCTGG 2160 
CCACTCGGAC GCAGATGTTC TGCACCAGTC ATTACTTGAA 2240 
CGCTTTCTCT ATTTACATTG GCGTTTAAGA ACCAGCTCCT 2320 
GATGTCTACC TGTGTTTTCT TCAAAAACAT CAGTCTGAAA 2400 
TTATAAGTTT CCCTCAACAT TCTATGAAGG GAGAGCGGAC 2480 
ACTCCAAGCT GAGCTCCATC AGGACGGAGG CCTCCCAGCT 2560 
AAGAAGTCCT TTGTCCGGAC ACATTTGCAA GTCATCATAT 2640 
AACCAGATTC CAGCA6TCCC T6TCCATCAT CAACAACTGT 2720 
CTGATGTGAA GGACTTAACC AAAAGGATAC GCACGGTGCT 2800 
GAGATGCTGG TGGACCTCCA GTACAGCCTG GCCAAATCCT 2880 
CATGGCCAGG ATCCATGTCA AAAATGGCGA TCTCTCAGAG 2960 
AATATCTCAC ACGGAAAGGC GTGTTTAGAC AAGGATGCAC 3040 
TCCATGATGG AAGACGTGGG GATGCAGGAT GTCCATTTCA 3120 
TGGACTCTGG AAAGCCGAGC GCTACGAGCT CATCGCCGAC 3200 
ATTTCTTTGA AGATGAAGAT GGAAAGGAGT ATATTTACAA 3280 
CTCCTTAAAC TGTACTCGGA TAAATTTGGT TCTGAAAATG 3360. 
TCT6GATTCT AAGTATGCAT ACATCCAGGT GACTCACGTC 3440 
CAGAGTTTGA GAGATCCCAC AACATCCGCC GCTTCATGTT 3520 
GTGGAAGAGC AGTGCAAACG GCGCACCATC CTGACAGCCA 3600 
GTACCAGCAC CACACTGACC TGAACCCCAT CGAGGTGGCC 3680 
TGTGCTCCTC GGCCGAGGTG GACATGATCA AACTGCAGCT 3760 
CCACTAGCAT ATGCGCGAGC TTTCTTAGAT GATACAAACA 3840 
AGTTTTCAGG CAATTTGTGG AAGCTTGCGG TCAAGCCTTA 3920 
ATCAGGAAGA AATGAAAGCC AACTACAGGG AAATGGCGAA 4000 
GAGGAGAAGA CGAGCGTCTT ACCGAATTCC CTTCACATCT 4080 
CGGGATGACC AGCTCGTCTT CGGTCGTGTG ATTACATCTC 4160 
CAGGATGCTT TCCAAAGCCA ATCACTGGGG AGACCGAGCA 4240 
AACAACGTTA TTTCTTAACA GACTTTCTAT AGGAGTTGTA 4320 
CAAAGTTTTC ATTGTGTCTT AACAAAGGTG TGGTAGACAC 4400 
GTGTTAGAAT AGATGGCCTA CAGAAAAAAA AGGTTCTGGG 4480 
GGGGACCTTT TGCCTCGACT CGTGCCGGAA ATCTGATCGT 4560 
TTGTATGACT AGGATTTGTG CTATTATCTC ATTCAACAAC 4640 
TTAATCCGCT ACTGGCTTCA AGTCAGAACT TT6TCATTAA 4720 
C^TTTTTAAT ACTCACATGG GCTTAT6CAT TAAGTTTAAT 4800 
T/IATGGTTTA TTCTTGTCAT AAAAATGTGC AATATGGAGA 4880 
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1 MEGHVMIAFL PTILNQLFRV LTRATQEEVA VNVTRVIIHV 
81 SMTTIIiKPSA DFLTSNKIiR YSWFFFDVLI KSMAQHLIEN 
161 SKNANHSIAV FIKRCFTEMD RGFVFKQINN YISCFAPGDP 
241 QIJ>YSI.TDEF CRNHFLVGLL LREVGTALQE FREVKLIAIS 
321 RINVBDVSPF PVNAGMTVKD ESLALPAVNP LVTPQKGSTL 
401 TDSGNSLPER NSEKSNSIX>K HQQSSTLGNS WRCDKLDQS 
481 EVCLHQFQYM GKRYIARNQE GLGPIVHDRK SQTLPVSRNR 
561 lATEVCLTAL DTLSLFTLAF KNQKLADHGH NPIMKKVFDV 
641 AALCYEILKC CNSKLSSIRT EASQLLYFIll RNNFDYTGKK 
721 SDRIilKHTSF SSDVKDLTKR IRTVIiMATAQ MKEHENDPEM 
801 MCYVHVTALV AEYLTRKGVF RQGCTAFRVI TPNIDEEASM 
881 KLIIPIYEKR RDFFEDEDGK EYIYKEPKLT PLSEISQRLL 
961 FFDEKELQER KTEFERSHNI RRFMFEMPFT QTGKRQGGVE 
1041 EMSKKVAELR QLCSSAEVDM IKLQLKLQGS VSVQVNAGPL 
1121 NERLIKEDQL EYQEEMKANY REMZ^LSEI MHEQICPLEE 
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VAQCHEEGLE SHLRSYVKYA YKAEPYVASE YKTVHEELTK 80 
SKVKLIiRNQR FPASYHHAAE TWNMLMPHI TQKFGDNPEA 160 
KTLFEYKFEF LRWCNHEHY IPLNLPMPFG KGRIQRYQDI. 240 
VLKNLLIKHS FDDRYASRSH QARIATLYLP LFGLLIENVQ 320 
DNSLHKDLLG AISGIASPYT TSTPNINSVR HADSRGSLIS 400 
EIKSLLMCFL YILKSMSDDA LFTYWNKAST SEtMDFFTIS 480 
TGMMHARLQQ LGSIJ3NSLTF NHSYGHSDAD VLHQSIiEAN 560 
YLCFLQKHQS ETALKNVFTA IMLIYKFPS TFYEGRADMC 640 
SFVRTHLQVI ISVSQLIADV VGIGETRFQQ SLSIINNCAN 720 
LVDLQYSIAK SYASTPELRK TWLDSMT^IH VKNGDLSEAA 800 
MEDVGMQDVH FNEDVIlffiU:. EQCADGLWKA ERYELIADIY 880 
KLYSDKFGSE NVKMIQDSGK VNPKDLDSKY AYIQVTHVIP 960 
EQCKRRTILT AIHCFPYVKK RIPVMYQHHT DLNPIEVAID 1040 
AYARAFLDDT NTKRYPDNKV KLLKEVFRQF VEACGQALAV 1120 
KTSVLPNSLH IFNAISGTPT STMVHGMTSS SSW 1195 
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1 AATTGTAATA CGACTCACTA TAGGGCGAAT TGGGTACCGG 
81 GAATTC6GCA CGAGTTTTAC ACCATCACCA AAACCCAGAA 
161 ATGAAAAGCA CCACCTGTTG CTCACATTCT TCCATGTCAG 
241 GTCGTTGAAA CCCAAGTTGG CTACTCCTGG CTTCCCCTCC 
321 GGTCTCGGCG AACCTTCCTT CGGGCTATCT TGGCTACCAA 
401 GGGTAGATGG AGGCAAGCCA CTGCTGAAAA TTTCCACTCA 
481 ATTTTTTCCA GTACTGTCAG AAAACCGAAT CTGGAGCCCA 
561 CATGCGATGG AAGGCCACGT GATGATCGCC TTCTTGCCCA 
641 ACAGGAAGAA GTCGCGGTTA ACGTGACTCG GGTCATTATT 
721 ACTTGAGGTC ATATGTTAAG TACGCGTATA AGGCTGAGCC 
801 ACCAAATCCA TGACCACGAT TCTCAAGCCT TCTGCCGATT 
881 CTTTGATGTA CTGATCAAAT CTATGGCTCA GCATTTGATA 
961 CTGCATCCTA TCATCATGCA GCGGAAACCG TTGTAAATAT 
1041 GAGGCATCTA AGAACGCGAA TCATAGCCTT GCTGTCTTCA 
1121 CAAGCAGATC AACAACTACA TTAGCTGTTT TGCTCCTGGA 
1201 GTGTAGTGTG CAACCATGAA CATTATATTC CGTTGAACTT 
1281 GACCTCCAGC TTGACTACTC ATTAACAGAT GAGTTCTGCA 
1361 GACAGCCCTC CA6GAGTTCC GGGAGGTCCG TCT6ATCGCC 
1441 ATGACAGATA TGCTTCAAGG AGCCATCAGG CAA6GATAGC 
1521 GTCCAGCGGA TCAATGTGAG 6GATGTGTCA CCCTTCCCTG 
1601 ACCAGCTGTG AATCCGCTGG TGACGCCGCA 6AAGGGAAGC 
1681 TCTCCGGCAT TGCTTCTCCA TATACAACCT CAACTCCAAA 
1761 ATAAGCACAG ATTCGGGTAA CAGCCTTCCA GAAAGGAATA 
1841 CACATTGGGA AATTCCGTGG TTCGCTGTGA TAAACTTGAC 
1921 TCTTAAAGAG CATGTCTGAT GATGCTTTGT TTACATATTG 
2001 ATATCTGAAG TCTGCCTGCA CCAGTTCCAG TACATGGGGA 
2081 AGTTCATGAT CGAAAGTCTC AGACATTGCC T6TTTCCCGT 
2161 GCAGCCTGGA TAACTCTCTC ACTTTTAACC ACAGCTATGG 
2241 GCCAACATTG CTACTGAGGT TTGCCTGACA 6CTCTGGACA 
2321 GGCCGACCAT GGACATAATC CTCTCATGAA AAAAGTTTTT 
2401 CGGCTTTAAA AAATGTCTTC ACTGCCTTAA GGTCCTTAAT 
2481 ATGTGTGC6G CTCTGTGTTA CGAGATTCTC AAGTGCTGTA 
2561 GCTCTACTTC CTGATGAGGA ACAACTTTGA TTACACTGGA 
2641 CTGTCAGCCA GCTGATAGCA GACGTTGTTG GCATTGGGGA 
2721 GCCAACAGTG ACCGGCTTAT TAAGCACACC AGCTTCTCCT 
2801 AATGGCCACC GCCCAGATGA A6GAGCATGA GAACGACCCA 
2881 ATGCCAGCAC GCCCGAGCTC AGGAAGACGT 6GCTC6ACAG 
2961 GCAGCAATGT GCTATGTCCA CGTAACAGCC CTAGTGGCAG 
3041 CGCCTTC7VGG GTCATTACCC CAAACATCGA CGAGGAGGCC 
3121 ACGAGGATGT GCTGATGGAG CTCCTTGAGG AGTGCGCAGA 
3201 ATCTACAAAC TTATCATCCC CATTTATGAG AAGCGGAGGG 
3281 GGAACCCAAA CTCACACCGC TGTCGGAAAT TTCTCAGAGA 
3361 TCAAAATGAT ACAGGATTCT GGCAAGGTCA ACCCTAAGGA 
3441 ATCCCCTTCT TTGACGAAAA AGAGTTGCAA GAAAGGAAAA 
3521 TGAGATGCCA TTTACGCAGA CCGGGAA6AG GCAGGGCGGG 
3601 TACACTGCTT CCCTTATGTG AAGAAGCGCA TCCCT6TCAT 
3681 ATTGACGAGA TGAGTAAGAA GGTGGCGGAG CTCCGGCAGC 
3761 CAAACTCCAG GGCAGCGTGA GTGTTCAGGT CAATGCTGGC 
3841 CAAAGCGATA TCCT6ACAAT AAAGTGAAGC TGCTTAAGGA 
3921 GCGGTAAACG AACGTCTGAT TAAAGAAGAC CAGCTCGAGT 
4001 GGAGCTTTCT GAAATCATGC ATGAGCAGAT CTGCCCCCTG 
4081 TCAACGCCAT CAGTGGGACT CCAACAAGCA CAATGGTTCA 
4161 ATGGCCCGTG TGTGGGGACT TGCTTTGTCA TTTGCAAACT 
4241 CAGGGAGGAC CAAGGGGAAG GGGAGAGAAA 6GAAATAAAG 
4321 AGAAGGTGCA CATATTTTTT TAAATCTCAC TGGCAATATT 
4401 TCTT6AGCTG GACTTAGATT TTATTCTTCC TTGCAGAGTA 
4481 ATCTACATGG CAGGGAGGGC TGCACTGACA TTGATGCCTG 
4561 AATCAGGGTA CAGAACTTAC TAGTTTTGTC TAGGAGTATG 
4641 ATAGAGCAAG AATAGTGAGC TAACTGAGCT AGACACTCAA 
4721 TCATCGACTC CGGGACGGTC ATATATGTAT TAXIIATTTCTA 
4801 TGTGATAAAT TTGTGCTGGT CCAGTATATG CAATACACTT 
4881 TGTATACAAG TCTTTACT 
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GCCCCCCCTC GAGGTCGACG. GTATCGATAA GCTTGATATC 80 
TTTTATGATG AGATTAAAAT AGAGTTGCCC ACTCAGCTGC 160 
CTGTGACAAC TCAAGTAAAG GAAGCACGAA GAAGAGGGAT 240 
TGAAAGACGG AAGGGTGGTG ACAAGCGAGC AGCACATCCC 320 
GAGCTTGGGA TGGGCAGGCA TTATGGTCCG 6AAATTAAAT 400 
TCTGGTTTCT ACAGGGATAC TCAGGATCAG CATTTACATA 480 
AGCCTTAGGA AACGAACTTG TAAAGTACCT TAAGAGTCTG 560 
CTATCCTAAA CCAGCTGTTC CGAGTCCTCA CCAGAGCCAC 640 
CATGTGGTTG CCCAGTGCCA TGAGGAAGGA TTGGAGAGCC 720 
ATATGTTGCC TCTGAATACA AGACAGTGCA TGAAGAACTG 800 
TCCTCACCAG CAACAAACTA CTGAGGTACT CATGGTTTTT 880 
GAGAACTCCA AAGTTAAGTT GCTGCGAAAC CAGAGATTTC 960 
GCTGATGCCA CACATCACTC AGAAGTTTGG AGATAATCCA 1040 
TCAAGAGATG TTTCACCTTC ATGGACAGGG GCTTTGTCTT 1120 
6ACCCAAAGA CCCTCTTTGA ATACAAGTTT GAATTTCTCC 1200 
ACCAATGCCA TTTGGAAAAG GCAGGATTCA AAGATACCAA 1280 
GAAACCACTT CTTGGTGGGA CTGTTACTGA GGGAGGTGGG 1360 
ATCAGTGT6C TCAAGAACCT GCTGATAAAG CATTCTTTTG 1440 
CACCCTCTAC CTGCCTCTGT TTGGTCTGCT GATTGAAAAC 1520 
TGAACGCGGG CATGACCGTG AAGGATGAAT CCCTGGCTCT 1600 
ACCCTGGACA ACAGCCTGCA CAAGGACCTG CTGGGCGCCA 1680 
CATCAACAGT GTGAGAAATG CTGATTCGAG AGGATCTCTC 1760 
GTGAGAAGAG CAATTCCCTG GATAAGCACC AACAAAGTAG 1840 
CAGTCTGAGA TTAAGAGCCT ACTGATGTGT TTCCTCTACA 1920 
GAACAAGGCT TCAACATCTG AACTTATGGA TTTTTTTACA 2000 
AGCGATACAT AGCCAGGAAC CAGGAGGGGT TGGGACCCAT 2080 
AACAGAACAG GAATGATGCA TGCCAGATTG CAGCAGCTGG 2160 
CCACTCGGAC GCAGATGTTC TGCACCAGTC ATTACTTGAA 2240 
CGCTTTCTCT ATTTACATTG GCGTTTAAGA ACCAGCTCCT 2320 
GATGTCTACC TGTGTTTTCT TCAAAAACAT CAGTCTGAAA 2400 
TTATAAGTTT CCCTCAACAT TCTATGAAGG GAGAGCGGAC 2480 
ACTCCAAGCT GAGCTCCATC AGGACGGAGG CCTCCCAGCT 2560 
AAGAAGTCCT TTGTCCGGAC ACATTTGCAA GTCATCATAT 2640 
AACCA6ATTC CAGCAGTCCC TGTCCATCAT CAACAACTGT 2720 
CTGATGTGAA GGACTTAACC AAAAGGATAC GCACGGTGCT 2800 
GAGATGCTG6 TGGACCTCCA GTACAGCCTG GCCAAATCCT 2880 
CATGGCCAGG ATCCATGTCA AAAATGGCGA TCTCTCAGAG 2960 
AATATCTCAC ACGGAAAGGC GTGTTTAGAC AAGGATGCAC 3040 
TCCATGATGG AAGACGTGGG GATGCAGGAT GTCCATTTCA 3120 
TGGACTCTGG AAAGCCGAGC GCTACGAGCT CATCGCCGAC 3200 
ATTTCTTTGA AGATGAAGAT GGAAAGGAGT ATATTTACAA 3280 
CTCCTTAAAC TGTACTCGGA TAAATTTG6T TCTGAAAATG 3360 
TCTGGATTCT AAGTATGCAT ACATCCAGGT GACTCACGTC 3440 
CAGAGTTTGA GAGATCCCAC AACATCCGCC GCTTCATGTT 3520 
6TGGAAGAGC AGTGCAAACG GCGCACCATC CTGACAGCCA 3600 
GTACCAGCAC CACACTGACC TGAACCCCAT CGAGGTGGCC 3680 
TGTGCTCCTC GGCCGAGGTG GACATGATCA AACTGCAGCT 3760 
CCACTAGCAT ATGCGCGAGC TTTCTTAGAT GATACAAACA 3840 
AGTTTTCAGG CAATTTGT6G AAGCTTGCGG TCAAGCCTTA 3920 
ATCAGGAAGA AATGAAAGCC AACTACAGGG AAATGGCGAA 4000 
GAGGAGAAGA CGAGCGTCTT ACCGAATTCC CTTCACATCT 4080 
CGGGATGACC AGCTCGTCTT CGGTCGTGTG ATTACATCTC 4160 
CAGGATGCTT TCCAAAGCCA ATCACTGGGG AGACCGAGCA 4240 
AACAACGTTA TTTCTTAACA GACTTTCTAT AGGAGTTGTA 4320 
CAAAGTTTTC ATTGTGTCTT AACAAAGGTG TGGTAGACAC 4400 
GTGTTAGAAT AGATGGCCTA CAGAAAAAAA AGGTTCTGGG 4480 
GGGGACCTTT TGCCTCGACT CGTGCCGGAA ATCTGATCGT 4560 
TTGTATGACT AGGATTTGTG CTATTATCTC ATTCAACAAC 4640 
TTAATCCGCT ACTGGCTTCA AGTCAGAACT TTGTCATTAA 4720 
CATTTTTAAT ACTCACATGG GCTTATGCAT TAAGTTTAAT 4800 
TAATGGTTTA TTCTTGTCAT AAAAAT6TGC AATATGGAGA 4880 
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1 MEGHVMIAFL PTimQLFRV LTRATQEEVA VNVTKVIIHV 
81 SMTTILKPSA DFLTSNKLLR YSWFFFDVLI KSMAQHLIEN 
161 SKNANHSIAV FIKRCFTEMD RGFVFKQINN YISCFAPGDP 
241 QLDYSIiTDEF CRNHFLVGIi UIEVGTALQE FREVRLIAIS 
321 RINVRDVSPF PVNAGMTVKD ESLALPAVNP LVTPQKGSTL 
401 TDSGNSLPER NSEKSNSUDK HQQSSTLGNS WRCDKLDQS 
481 EVCLHQFQYM GKRYIARNQE GLGPIVHDRK SQTLPVSRNR 
561 lATEVCIiTAIi DTLSLFTLAF KNQLLADHGH NPLMKKVFDV 
641 AALCYEILKC CNSKLSSIRT EASQLLYFm RNNFDYTGKK 
721 SDRLIKHTSF SSDVKDLTKR IRTVIMATAQ MKEHENDPEM 
801 MCYVHVTALV AEYLTRKGVF RQGCTAFRVI TPNIDEE2VSM 
881 KLIIPIYEKR RDFFEDEDGK EYIYKEPKLT PLSEISQRLL 
961 FFDEKEIiQER KTEFERSHNI RRFMFEMPFT QTGKRQGGVE 
1041 EMSKKVAELR QLCSSAEVDM IKLQLKLQGS VSVQVNAGPL 
1121 NEKLIKEDQL EYQEEMKANY KEMAKELSEI MHEQICPLEE 
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VAQCHEEGLE SHLRSYVKYA . YKAEPYVASE YKTVHEELTK 80 
SKVKLLRNQR FPASYHHAAE IWNMLMPHI TQKFGDNPEA 160 
KTLFEYKFEF LRWCNHEHY IPLNLPMPFG KGRIQRYQDL 240 
VLKNLLIKHS FDDRYASRSH QARIATLYLP LFGLLIENVQ 320 
DNSLHKDLLG AISGIASPYT TSTPNINSVR NADSRGSLIS 400 
EIKSIiliMCFL YILKSMSDDA LFTYWNKAST SELMDFFTIS 480 
TGbdMHARLQQ LGSLDNSLTF NHSYGHSDAD VLHQSIiLEAN 560 
YLCFLQKHQS ETALKNVFTA LRSLIYKFPS TFYEGRADMC 640 
SFVRTHIiQVI ISVSQLIADV VGIGETRFQQ SLSIINNCAN 720 
LVDLQYSIAK SYASTPELRK TWLDSMARIH VKNGDLSEAA 800 
MEDVGMQDVH FNEDVLMELL EQCADGLWKA ERYELIADIY 880 
KLYSDKFGSE NVKMIQDSGK VNPKDLDSKY AYIQVTHVIP 960 
EQCKRRTILT AIHCFPYVKK RIPVMXQHHT DI^IEVAID 1040 
AYARAFIiDDT NTKRYPDNKV KLLKEVFRQF VEACQQALAV 1120 
KTSVIiPNSLH IFNAIS6TPT STMVHGMTSS SSW 1195 
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1 AATTGTAATA CGACTCACTA TAGGGCGAAT TGGGTACCGG 
81 GAATTCGGCA CGAGTTTTAC ACCATCACCA AAACCCAGAA 
161 ATGAAAAGCA CCACCTGTTG CTCACATTCT TCCATGTCAG 
241 GTCGTTGAAA CCCAAGTTGG CTACTCCTGG CTTCCCCTCC 
321 GGTCTCGGCG AACCTTCCTT CGGGCTATCT TGGCTACCAA 
401 GGGTAGATGG AGGCAAGCCA CTGCTGAAAA TTTCCACTCA 
481 ATTTTTTCCA GTACTGTCAG AAAACCGAAT CTGGAGCCCA 
561 CATGCGATGG AAGGCCACGT GATGATCGCC TTCTTGCCCA 
641 ACAGGAAGAA GTCGC66TTA ACGTGACTCG GGTCATTATT 
721 ACTTGAGGTC ATATGTTAAG TACGCGTATA AGGCTGAGCC 
801 ACCAAATCCA TGACCACGAT TCTCAAGCCT TCTGCCGATT 
881 CTTTGATGTA CTGATCAAAT CTATGGCTCA GCATTTGATA 
961 CTGCATCCTA TCATCATGCA GCGGAAACCG TTGTAAATAT 
1041 GAGGCATCTA AGAACGCGAA TCATAGCCTT GCTGTCTTCA 
1121 CAAGCAGATC AACAACTACA TTAGCTGTTT TGCTCCTGGA 
1201 GTGTAGTGTG CAACCATGAA CATTATATTC CGTTGAACTT 
1281 GACCTCCAGC TTGACTACTC ATTAACAGAT GAGTTCTGCA 
1361 GACAGCCCTC CAGGAGTTCC GGGAG6TCCG TCTGATCGCC 
1441 ATGACAGATA TGCTTCAAGG AGCCATCAGG CAAGGATAGC 
1521 GTCCAGCGGA TCAATGTGAG 6GATGTGTCA CCCTTCCCTG 
1601 ACCAGCTGTG AATCCGCTGG TGACGCCGCA GAAGGGAAGC 
1681 TCTCCGGCAT TGCTTCTCCA TATACAACCT CAACTCCAAA 
1761 ATAAGCACAG ATTCGGGTAA CAGCCTTCCA GAAAGGAATA 
1841 -CACATTGGGA AATTCCGTGG TTCGCTGTGA TAAACTTGAC 
1921 TCTTAAAGAG CATGTCTGAT GATGCTTTGT TTACATATTG 
2001 ATATCTGAAG TCTGCCT6CA CCAGTTCCAG TACATGGGGA 
2081 AGTTCATGAT CGAAAGTCTC AGACATTGCC T6TTTCCCGT 
2161 GCAGCCTGGA TAACTCTCTC ACTTTTAACC ACAGCTATGG 
2241 6CCAACATTG CTACTGAGGT TTGCCTGACA GCTCTGGACA 
2321 GGCC6ACCAT GGACATAATC CTCTCATGAA AAAAGTTTTT 
2401 CGGCTTTAAA AAATGTCTTC ACTGCCTTAA GGTCCTTAAT 
2481 ATGTGTGCGG CTCTGTGTTA CGAGATTCTC AAGTGCTGTA 
2561 GCTCTACTTC CTGATGAGGA ACAACTTTGA TTACACTGGA 
2641 CTGTCAGCCA GCTGATAGCA GACGTTGTTG GCATTGGGGA 
2721 GCCAACAGTG ACCGGCTTAT TAAGCACACC AGCTTCTCCT 
2801 AATGGCCACC GCCCAGATGA AGGAGCATGA GAZ^GACCCA 
2881 ATGCCAGCAC GCCCGAGCTC AGGAAGACGT GGCTCGACAG 
2961 6CAGCAATGT GCTATGTCCA CGTAACAGCC CTAGTGGCAG 
3041 CGCCTTCAGG GTCATTACCC CAAACATCGA CGAGGAGGCC 
3121 ACGAGGATGT GCTGATGGAG CTCCTTCAGC AGTGCGCAGA 
3201 ATCTACAAAC TTATCATCCC CATTTATGAG AAGCGGAGGG 
3281 GGAACCCAAA CTCACACCGC TGTCGGAAAT TTCTCAGA6A 
3361 TCAAAATGAT ACAGGATTCT GGCAAGGTCA ACCCTAAGGA 
3441 ATCCCCTTCT TTGAC6AAAA AGAGTT6CAA GAAAGGAAAA 
3521 TGAGATGCCA TTTACGCAGA CCGGGAAGAG GCAGGGCG6G 
3601 TACACTGCTT CCCTTATGTG AAGAAGCGCA TCCCTGTCAT 
3681 ATTGACGAGA TGAGTAAGAA GGTGGC6GAG CTCCGGCAGC 
3761 CAAACTCCAG GGCAGCGTGA GTGTTCAGGT CAATGCTGGC 
3841 CAAAGCGATA TCCTGACAAT AAAGTGAAGC TGCTTAAGGA 
3921 GCGGTAAACG AACGTCTGAT TAAAGAAGAC CAGCTCGAGT 
4001 GGAGCTTTCT GAAATCATGC ATGAGCAGAT CTGCCCCCTG 
4081 TCAACGCCAT CAGTGGGACT CCAACAAGCA CAATGGTTCA 
4161 ATGGCCCGTG TGTGGGGACT TGCTTTGTCA TTTGCAAACT 
4241 CAGGGAGGAC CAAGGGGAAG GGGAGAGAAA 6GAAATAAAG 
4321 AG2^AGGTGCA CATATTTTTT TAAATCTCAC TGGCAATATT 
4401 TCTTGAGCTG GACTTAGATT TTATTCTTCC TTGCAGAGTA 
4481 ATCTACATGG CAGGGAGGGC TGCACTGACA TTGATGCCTG 
4561 AATCAGGGTA CAGAACTTAC TAGTTTTGTC TAGGAGTATG 
4641 ATAGAGCAAG AATAGTGAGC TAACTGAGCT AGACACTCAA 
4721 TCATCGACTC CGGGACGGTC ATATATGTAT TACATTTCTA 
4801 TGTGATAAAT TTGTGCTGGT CCAGTATATG CAATACACTT 
4881 TGTATACAAG TCTTTACT 
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GCCCCCCCTC GAGGTCGACG GTATCGATAA GCTTGATATC 80 
TTTTATGATG AGATTAAAAT AGAGTTGCCC ACTCAGCTGC 160 
CTGTGACAAC TCAAGTAAAG' GAAGCACGAA GAAGAGGGAT 240 
TGAAAGACGG AAGGGTGGTG ACAAGCGAGC AGCACATCCC 320 
GAGCTTGGGA TGGGCAGGCA TTATGGTCCG GAAATTAAAT 400 
TCTGGTTTCT ACAGGGATAC TCAGGATCAG CATTTACATA 480 
AGCCTTAGGA AACGAACTTG TAAAGTACCT TAAGAGTCTG 560 
CTATCCTAAA CCAGCTGTTC CGAGTCCTCA CCAGAGCCAC 640 
CATGTGGTTG CCCAGTGCCA TGAGGAAGGA TTGGAGAGCC 720 
ATATGTTGCC TCTGAATACA AGACAGTGCA TGAAGAACTG 800 
TCCTCACCAG CAACAAACTA CTGAGGTACT CATGGTTTTT 880 
GAGAACTCCA AAGTTAAGTT GCTGCGAAAC CAGAGATTTC 960 
GCTGATGCCA CACATCACTC AGAAGTTTGG AGATAATCCA 1040 
TCAAGAGATG TTTCACCTTC ATGGACAGGG GCTTTGTCTT 1120 
GACCCAAAGA CCCTCTTTGA ATACAAGTTT GAATTTCTCC 1200 
ACCAATGCCA TTTGGAAAAG GCAGGATTCA AAGATACCAA 1280 
GAAACCACTT CTTGGTGGGA CTGTTACTGA GGGAGGTGGG 1360 
ATCAGTGTGC TCAAGAACCT GCTGATAAAG CATTCTTTTG 1440 
CACCCTCTAC CTGCCTCTGT TTGGTCTGCT GATTGAAAAC 1520 
TGAACGCGGG CATGACCGTG AAGGATGAAT CCCTGGCTCT 1600 
ACCCTGGACA ACAGCCTGCA CAAGGACCTG CTGGGCGCCA 1680 
CATCAACAGT GTGAGAAATG CTGATTCGAG AGGATCTCTC 1760 
GTGAGAAGAG CAATTCCCTG GATAAGCACC AACAAAGTAG 1840 
CAGTCTGAGA TTAAGAGCCT ACTGATGT6T TTCCTCTACA 1920 
GAACAAGGCT TCAACATCTG AACTTAT6GA TTTTTTTACA 2000 
AGCGATACAT AGCCAGGAAC CAGGAGGGGT TGGGACCCAT 2080 
AACAGAACAG GAATGATGCA TGCCAGATTG CAGCAGCTGG 2160 
CCACTCGGAC GCAGATGTTC TGCACCAGTC ATTACTTGAA 2240 
CGCTTTCTCT ATTTACATTG GCGTTTAAGA ACCAGCTCCT 2320 
GATGTCTACC TGTGTTTTCT TCAAAAACAT CAGTCTGAAA 2400 
TTATAAGTTT CCCTCAACAT TCTATGAAGG GAGAGCGGAC 2480 
ACTCCAAGCT GAGCTCCATC AGGACGGAGG CCTCCCAGCT 2560 
A2^GAAGTCCT TTGTCCGGAC ACATTTGCAA GTCATCATAT 2640 
AACCAGATTC CAGCAGTCCC TGTCCATCAT CAACAACTGT 2720 
CTGATGTGAA GGACTTAACC AAAAGGATAC GCACGGTGCT 2800 
GAGATGCTGG TGGACCTCCA GTACAGCCTG GCCAAATCCT 2880 
CATGGCCAGG ATCCATGTCA AAAATGGCGA TCTCTCAGAG 2960 
AATATCTCAC ACGGAAAGGC GTGTTTAGAC AAGGATGCAC 3040 
TCCATGATGG AAGACGTGGG GATGCAGGAT GTCCATTTCA 3120 
TGGACTCTGG AAAGCCGAGC GCTACGAGCT CATCGCCGAC 3200 
ATTTCTTTGA AGATGAAGAT GGAAAGGAGT ATATTTACAA 3280 
CTCCTTAAAC TGTACTC6GA TAAATTTGGT TCTGAAAATG 3360 
TCTGGATTCT AAGTATGCAT ACATCCAGGT GACTCACGTC 3440 
CA6AGTTTGA 6AGATCCCAC AACATCCGCC GCTTCATGTT 3520 
GTGGAAGAGC AGTGCAAACG GCGCACCATC CTGACAGCCA 3600 
GTACCAGCAC CACACTGACC TGAACCCCAT CGAGGTGGCC 3680 
TGTGCTCCTC GGCCGAGGTG GACATGATCA AACTGCAGCT 3760 
CCACTAGCAT ATGCGCGAGC TTTCTTAGAT GATACAAACA 3840 
AGTTTTCAGG CAATTTGTGG AAGCTTGCGG TCAA6CCTTA 3920 
ATCAGGAAGA AATGAAAGCC AACTACAGGG AAATGGCGAA 4000 
GAGGAGAAGA CGAGC6TCTT ACCGAATTCC CTTCACATCT 4080 
CGGGATGACC AGCTCGTCTT CGGTCGTGTG ATTACATCTC 4160 
CAGGATGCTT TCCAAAGCCA ATCACTGGGG AGACCGAGCA 4240 
AACAACGTTA TTTCTTAACA GACTTTCTAT AGGAGTTGTA 4320 
CAAAGTTTTC ATTGTGTCTT AACAAAGGTG TGGTAGACAC 4400 
GTGTTAGAAT AGATGGCCTA CAGAAAAAAA AGGTTCTGGG 4480 
GGGGACCTTT TGCCTCGACT CGTGCCGGAA ATCTGATCGT 4560 
TTGTATGACT AGGATTTGTG CTATTATCTC ATTCAACAAC 4640 
TTAATCCGCT ACTGGCTTCA AGTCAGAACT TTGTCATTAA 4720 
CATTTTTAAT ACTCACATGG GCTTATGCAT TAAGTTTAAT 4800 
TAATGGTTTA TTCTTGTCAT AAAAATGTGC AATATGGAGA 4880 
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1 MEGHVMIAFL PTIIiNOLFRV LTRATQEEVA VNVTRVIIHV 
81 SMTTII.KPSA DFLTSNKLUl YSWFFFDVLI KSMAQHLIEN 
161 SKNANHSIAV FIKRCFTFMD RGFVFKQINN YISCFAPGDP 
241 QUDYSLTDEF CRNHFLVGLL UIEVGTALQE FKEVKLIAIS 
321 RINVRDVSPF PVNAGMTVKD ESIALPAVNP LVTPQKGSTL 
401 TDSGNSLPER NSEKSNSLDK HQQSSTLGNS WRCDKLDQS 
481 EVCLHQPQYM GKRYIABNQE GLGPIVHDRK SQTLFVSRNR 
561 lATEVCLTAL DTLSLFTIAF KNQLLADHGH NPIMKKVFDV 
641 AALCYEILKC CNSKLSSIRT EASQLLYFLM RNNFDYTGKK 
721 SDRLIKHTSF SSDVKDLTKR IRTVtMATAQ MKEHENDPEM 
801 MCYVHVTALV AEYLTRKCVF RQGCTAFRVI TPNIDEEASM 
881 KLIIPIYEKR RDFFEDEDGK EYIYKEPKLT PLSEISQRUL 
961 FFDEKELQER KTEFERSHNI RRFMFEMPFT QTGKRQGGVE 
1041 EMSKKVAELR QLCSSAEVDM IKUiLKLQGS VSVQVNAGPL 
1121 NERLIKEDQL EYQEEMKANY REMJUOZLSEI MHEQICPI^E 
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VAQCHEEGLE SHUISYVKYA YKAEPYVASE YKTVHEELTK 80 
SKVKLLRNQR FPASYHHAAE.. TWNMLMPHI TQKFGDNPEA 160 
KTLFEYKFEF LRWCNHEHY IPLNLPMPFG KGRIQRYQDL 240 
VLKNLLIKHS FDDRYASRSH QZVRIATLYLP LFGLLIENVQ 320 
DNSLHKDLLG AISGIASPYT TSTPNINSVR NADSRGSLIS 400 
EIKSLLMCFL YILKSMSDDA LFTYWNKAST SEIMDFFTIS 480 
TGMMHARIiQQ LGSLDNSLTF NHSYGHSDAD VLHQSI*LEAN 560 
YLCFI^KHQS ETALKNVFTA UlSLIYKFPS TFYEGRADMC 640 
SFVRTHLQVI ISVSQLIADV VGIGETRFQQ SLSIINNCAN 720 
LVDLQYSIAK SYASTPEIiRK TWLDSMARIH VKNGDLSEAA 800 
MEDVGMQDVH FNEDVI24ELL EQCADGLWKA ERYELIADIY 880 
KLYSDKFGSE NVKMIQDSGK VNPKDLDSKY AYIQVTHVIP 960 
EQCKRRTILT AIHCFPYVKK RIPVMYQHHT DIOTIEVAID 1040 
AYARAFLDDT NTKRYPDNKV KLLKEVFRQF VEACGQALAV 1120 
KTSVLPNSLH IFNAISGTPT STMVHGMTSS SSW 1195 
I 50 I 60 I 70 I 80 



FIG. 10G (cont.) 



86/87 



I 10 I 20 I 30 I 40 

1 AATTGTAATA CGACTCACTA TAGGGCGAAT TGGGTACCGG 
81 GAATTCGGCA C6AGTTTTAC ACCATCACCA AAACCCAGAA 
161 ATGAAAAGCA CCACCTGTTG CTCACATTCT TCCATGTCAG 
241 GTCGTTGAAA CCCAAGTTGG CTACTCCTGG CTTCCCCTCC 
321 GGTCTCGGCG AACCTTCCTT CGGGCTATCT TGGCTACCAA 
401 GGGTAGATGG AGGCAAGCCA CTGCTGAAAA TTTCCACTCA 
481 ATTTTTTCCA GTACTGTCAG AAAACCGAAT CTGGAGCCCA 
561 CATGCGATGG AAGGCCACGT GATGATCGCC TTCTTGCCCA 
641 ACAGGAAGAA GTCGCGGTTA ACGTGACTCG 6GTCATTATT 
721 ACTTGAGGTC ATATGTTAAG TACGCGTATA AGGCTGAGCC 
801 ACCAAATCCA TGACCACGAT TCTCAAGCCT TCTGCCGATT 
881 CTTTGATGTA CTGATCAAAT CTATGGCTCA GCATTTGATA 
961 CTGCATCCTA TCATCATGCA GCGGAAACCG TTGTAAATAT 
1041 GAGGCATCTA AGAACGCGAA TCATAGCCTT GCTGTCTTCA 
1121 CAAGCAGATC AACAACTACA TTAGCTGTTT TGCTCCTGGA 
1201 GTGTAGTGTG CAACCATGAA CATTATATTC CGTTGAACTT 
1281 GACCTCCAGC TTGACTACTC ATTAACAGAT GAGTTCTGCA 
1361 GACAGCCCTC CAGGAGTTCC GGGA6GTCCG TCTGATCGCC 
1441 ATGACAGATA TGCTTCAAGG AGCCATCAGG CAAGGATAGC 
1521 GTCCAGCGGA TCAATGTGAG GGATGTGTCA CCCTTCCCTG 
1601 ACCAGCTGTG AATCCGCTGG TGACGCCGCA GAAGGGAAGC 
1681 TCTCCGGCAT TGCTTCTCCA TATACAACCT CAACTCCAAA 
1761 ATAAGCACAG ATTCGGGTAA CAGCCTTCCA GAAAGGAATA 
1841 CACATTGGGA AATTCCGTGG TTCGCTGTGA TAAACTTGAC 
1921 TCTTAAAGAG CATGTCTGAT GATGCTTTGT TTACATATTG 
2001 ATATCTGAAG TCTGCCTGCA CCAGTTCCAG TACATGGGGA 
2081 AGTTCATGAT CGAAAGTCTC AGACATTGCC TGTTTCCCGT 
2161 GCAGCCTGGA TAACTCTCTC ACTTTTAACC ACAGCTATGG 
2241 6CCAACATTG CTACTGAGGT TTGCCTGACA GCTCTGGACA 
2321 GGCCGACCAT GGACATAATC CTCTCATGAA AAAAGTTTTT 
2401 CGGCTTTAAA AAATGTCTTC ACTGCCTTAA GGTCCTTAAT 
2481 ATGTGTGCGG CTCTGTGTTA CGAGATTCTC AAGTGCTGTA 
2561 GCTCTACTTC CTGATGAGGA ACAACTTTGA TTACACTGGA 
2641 CTGTCAGCCA GCTGATAGCA GACGTTGTTG GCATTGGGGA 
2721 GCCAACAGTG ACCGGCTTAT TAAGCACACC AGCTTCTCCT 
2801 AATGGCCACC GCCCAGATGA AGGAGCATGA GAAC6ACCCA 
2881 ATGCCAGCAC GCCCGAGCTC AGGAAGAC6T 6GCTCGACAG 
2961 GCAGCAATGT GCTATGTCCA CGTAACAGCC CTAGTGGCAG 
3041 CGCCTTCAGG GTCATTACCC CA2VACATCGA CGAGGAGGCC 
3121 ACGAGGATGT GCTGATGGAG CTCCTTGAGC AGTGCGCAGA 
3201 ATCTACAAAC TTATCATCCC CATTTATGAG AAGCGGAGGG 
3281 GGAACCCAAA CTCACACCGC TGTCGGAAAT TTCTCAGAGA 
3361 TCAAAATGAT ACAGGATTCT GGCAAGGTCA ACCCTAAGGA 
3441 ATCCCCTTCT TTGACGAAAA AGAGTTGCAA GAAAGGAAAA 
3521 TG2«5ATGCCA TTTACGCAGA CCGGGAAGAG GCAGGGCGGG 
3601 TACACTGCTT CCCTTATGT6 AAGAAGCGCA TCCCTGTCAT 
3681 ATTGACGAGA T6AGTAAGAA GGTGGCGGAG CTCCGGCAGC 
3761 CAAACTCCAG GGCAGCGTGA 6TGTTCAGGT CAATGCTGGC 
3841 CAAAGCGATA TCCTGACAAT AAAGTGAAGC TGCTTAAGGA 
3921 GCGGTAAACG AACGTCTGAT TAAAGAAGAC CAGCTCGAGT 
4001 GGAGCTTTCT GAAATCATGC ATGAGCAGAT CTGCCCCCTG 
4081 TCAACGCCAT CAGTGGGACT CCAACAAGCA CAATGGTTCA 
4161 ATGGCCCGTG TGTGGGGACT TGCTTTGTCA TTTGCAAACT 
4241 CAGGGAGGAC CAAGGGGAAG GGGAGAGAAA GGAAATAAAG 
4321 AGAAGGTGCA CATATTTTTT TAAATCTCAC TGGCAATATT 
4401 TCTTGAGCTG GACTTAGATT TTATTCTTCC TTGCAGAGTA 
4481 ATCTACATGG CAGGGAGGGC TGCACTGACA TTGATGCCTG 
4561 AATCAGGGTA CAGAACTTAC TAGTTTTGTC TAGGAGTATG 
4 641 ATAGAGCAAG AATAGTGAGC TAACTGAGCT AGACACTCAA 
4721 TCATCGACTC CGGGACGGTC ATATATGTAT TACATTTCTA 
4801 TGTGATAAAT TTGTGCTGGT CCAGTATATG CAATACACTT 
4881 TGTATACAAG TCTTTACT 
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GCCCCCCCTC GAGGTCGACG. GTATCGATAA GCTTGATATC 80 
TTTTATGATG AGATTAAAAT AGAGTTGCCC ACTCAGCTGC 160 
CTGTGACAAC TCAAGTAAAG GAAGCACGAA GAAGAGGGAT 240 
TGAAAGACGG AAGGGTGGTG ACAAGCGAGC AGCACATCCC 320 
GAGCTTGGGA TGGGCAGGCA TTATGGTCCG GAAATTAAAT 400 
TCTGGTTTCT ACAGGGATAC TCAGGATCAG CATTTACATA 480 
AGCCTTAGGA AACGAACTTG TAAAGTACCT TAAGAGTCTG 560 
CTATCCTAAA CCAGCTGTTC CGAGTCCTCA CCAGAGCCAC 640 
CATGTGGTTG CCCAGTGCCA TGAGGAAGGA TTGGAGAGCC 720 
ATATGTTGCC TCTGAATACA AGACAGTGCA TGAAGAACTG 800 
TCCTCACCAG CAACAAACTA CTGAGGTACT CATGGTTTTT 880 
GAG2^CTCCA AAGTTAAGTT GCTGCGAAAC CAGAGATTTC 960 
GCTGATGCCA CACATCACTC AGAAGTTTGG AGATAATCCA 1040 
TCAAGAGATG TTTCACCTTC ATGGACAGGG GCTTTGTCTT 1120 
GACCCAAAGA CCCTCTTTGA ATACAAGTTT GAATTTCTCC 1200 
ACCAATGCCA TTTGGAAAAG GCAGGATTCA AAGATACCAA 1280 
GAAACCACTT CTTGGTGGGA CTGTTACTGA GGGAGGTGGG 1360 
ATCAGTGT6C TCAAGAACCT GCTGATAAAG CATTCTTTTG 1440 
CACCCTCTAC CTGCCTCTGT TTGGTCTGCT GATTGAAAAC 1520 
TGAACGCGGG CATGACCGTG AAGGATGAAT CCCTGGCTCT 1600 
ACCCTGGACA ACAGCCTGCA CAAGGACCTG CTGGGCGCCA 1680 
CATCAACAGT GTGAGAAATG CTGATTCGAG AGGATCTCTC 1760 
GTGAGAAGAG CAATTCCCTG GATAAGCACC AACAAA6TAG 1840 
CAGTCTGAGA TTAAGAGCCT ACTGATGTGT TTCCTCTACA 1920 
GAACAAGGCT TCAACATCTG AACTTATGGA TTTTTTTACA 2000 
A6CGATACAT AGCCAGGAAC CAGGAGGGGT TGGGACCCAT 2080 
AACAGAACAG GAATGATGCA TGCCAGATTG CAGCAGCTGG 2160 
CCACTCGGAC GCAGATGTTC TGCACCAGTC ATTACTTGAA 2240 
CGCTTTCTCT ATTTACATTG GCGTTTAAGA ACCAGCTCCT 2320 
GATGTCTACC TGTGTTTTCT TCAAAAACAT CAGTCTGAAA 2400 
TTATAAGTTT CCCTCAACAT TCTATGAAGG GAGAGCGGAC 2480 
ACTCCAAGCT GAGCTCCATC AGGACGGAGG CCTCCCAGCT 2560 
AAGAAGTCCT TTGTCCGGAC ACATTTGCAA GTCATCATAT 2640 
AACCAGATTC CAGCAGTCCC TGTCCATCAT CAACAACTGT 2720 
CTGATGTGAA GGACTTAACC AAAAGGATAC GCACGGTGCT 2800 
GAGATGCTGG TGGACCTCCA GTACAGCCTG GCCAAATCCT 2880 
CATGGCCAGG ATCCATGTCA AAAATGGCGA TCTCTCAGAG 2960 
AATATCTCAC ACGGAAAGGC GTGTTTAGAC AAGGATGCAC 3040 
TCCATGATGG AAGACGTGGG GATGCAGGAT GTCCATTTCA 3120 
TGGACTCTGG AAAGCCGAGC GCTACGAGCT CATCGCCGAC 3200 
ATTTCTTTGA AGATGAAGAT GGAAAGGAGT ATATTTACAA 3280 
CTCCTTAAAC TGTACTCGGA TAAATTTGGT TCTGAAAATG 3360 
TCTGGATTCT AAGTATGCAT ACATCCA6GT GACTCACGTC 3440 
CA6AGTTTGA GAGATCCCAC AACATCCGCC GCTTCATGTT 3520 
GTGGAAGAGC AGTGCAAACG GCGCACCATC CTGACAGCCA 3600 
GTACCAGCAC CACACTGACC TG7ACCCCAT CGAGGTGGCC 3680 
TGTGCTCCTC GGCCGAGGTG GACATGATCA AACTGCAGCT 3760 
CCACTAGCAT ATGCGCGAGC TTTCTTAGAT GATACAAACA 3840 
AGTTTTCAGG CAATTTGTGG AAGCTTGCGG TCAAGCCTTA 3920 
ATCAGGAAGA AATGAA2VGCC AACTACAGGG AAATGGCGAA 4000 
GAGGAGAAGA CGAGCGTCTT ACCGAATTCC CTTCACATCT 4080 
CGGGATGACC AGCTCGTCTT CGGTCGTGTG ATTACATCTC 4160 
CAGGATGCTT TCCAAAGCCA ATCACTG6GG AGACCGAGCA 4240 
AACAACGTTA TTTCTTAACA GACTTTCTAT AGGAGTTGTA 4320 
CAAAGTTTTC ATTGTGTCTT AACAAAGGTG TGGTAGACAC 4400 
GTGTTAGAAT AGATGGCCTA CAGAAAAAAA AGGTTCTGGG 4480 
GGGGACCTTT TGCCTCGACT CGTGCCGGAA ATCTGATCGT 4560 
TTGTATGACT AGGATTTGTG CTATTATCTC ATTCAACAAC 4640 
TTAATCCGCT ACTGGCTTCA AGTCAGAACT TTGTCATTAA 4720 
CK'I'ST'SlM^'i: ACTCACATGG GCTTATGCAT TAA6TTTAAT 4800 
TAATGGTTTA TTCTTGTCAT AAAAATGTGC AATATGGAGA 4880 

4898 
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1 MEGHVMIAFL PTItNQLFRV LTRATQEEVA VNVTRVIIHV 
81 SMTTILKPSA DFLTSNKLLR YSWFFFDVLI KSMAQHLIEN 
161 SKNANHSLAV FIKRCFTFMD RGEVFKQINN YISCFAPGDP 
241 QLDYSLTDEF CRNHFLVGLL I*REVGTAI<2E FKEVRLIAIS 
321 RINVBDVSPF PVNAGMTVKD ESLALPAVNP LVTPQKGSTL 
401 TDSGNSLPER NSEKSNSLDK HQQSSTLGNS VVRCDKIJ3QS 
481 EVCLHQFQYM GKRYIARNQE GLGPIVHDRK SQTLPVSRNR 
561 lATEVCLTAL DTLSLFTLAF KNQLLADHGH NPLMKKVFDV 
641 AALCYEILKC CNSKLSSIRT EASQLLYFLM RNNFDYTGKK 
721 SDRLIKHTSF SSDVKDLTKR IRTVLMATAQ MKEHENDPEM 
801 MCYVHVTALV AEYLTRKGVF RQGCTAFRVI TPNIDEEASM 
881 KLIIPIYEKR RDFFEDEDGK EYIYKEPKLT PLSEISQRLL 
961 FFDEKELQER KTEFERSHNI RREMFEMPFT QTGKRQGGVE 
1041 EMSKKVAELR QLCSSAEVDM IKLQLKLQGS VSVQVNAGPL 
1121 NEKLIKEDQI. EYQEEMKANY REMAKELSEI MHEQICPLEE 
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VAQCHEEGLE SHLRSYVKYA YKAEPYVASE YKTVHEELTK 80 
SKVKLLRNQR FPASYHHAAE ..TWNMLMPHI TQKFGDNPEA 160 
KTLFEYKFEF LRWCNHEHY IPI^PMPFG KGRIQRYQDL 240 
VLKNLLIKHS FDDRYASRSH QARIATLYLP LFGLLIENVQ 320 
DNSLHKDLLG AISGIASPYT TSTPNINSVR N2U3SRGSLIS 400 
EIKSLI24CFL YILKSMSDDA LFTYWNKAST SEI2ffi>FFTIS 480 
TGMMHARLQQ LGSLDNSLTF NHSYGHSDAD VLHQSIiEAN 560 
YLCFLQKHQS ETALKNVFTA LRSLIYKFPS TFYEGRADMC 640 
SFVRTHLQVI ISVSQLIADV VGIGETRFQQ SLSIINNCAN 720 
LVDLQYSIiAK SYASTPEIiRK TWLDSMZ^IH VKNGDLSEAA 800 
MEDVGMQDVH FNEDVIMELL EQCADGI.WKA ERYELIADIY 880 
KLYSDKFGSE NVKhCEQDSGK VNPKDLDSKY AYIQVTHVIP 960 
EQCKRRTILT AIHCFPYVKK RIPVMYQHHT DLNPIEVAID 1040 
AYARAFUDDT NTKRYPDNKV KLLKEVFRQF VEACGQALAV 1120 
KTSVLPNSLH IFNAISGTPT STMVHGMTSS SSW 1195 
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